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Description 

FIELD OF THE INVENTION 

5 [0001] This invention relates generally to the assessment of nucleic acids in human or animal tissue samples. More 
particularly, the invention relates to the simultaneous measurement in tissue samples of gene expression and of chro- 
mosome abnormalities. 

BACKGROUND OF THE INVENTION 

10 

[0002] Abnormalities in the expression of genes, both in the timing and level of expression of particular genes, are a 
fundamental cause of cancer and other human disease. Abnormalities in genomic DNA, i.e. in chromosomes, are also 
a fundamental cause of cancer and other human disease, often leading to the over-expression or under-expression of 
genes. Some chromosomal abnormalities, such as balanced translocations and inversions between chromosomes, and 

15 base pair changes, do not involve a change in DNA sequence copy number. Other genomic DNA abnormalities comprise 
changes in DNA sequence copy number from the normal one copy per chromosome. These genomic DNA abnormalities 
often are referred to as gene amplification for copy number increase and gene deletion for copy number decrease. For 
example, one aggressive form of breast cancer, occurring in about 25-30% of breast cancers, results from the gene 
amplification and over-expression of the Her-2/neu oncogene, which is located on chromosome 17 at band q12. Breast 

20 cancer patients with this genetic abnormality have a significantly poorer prognosis, both for overall survival and disease- 
free survival, than patients without this abnormality. In addition, over-expression of the Her-2 gene occurs, in the absence 
of gene amplification of the chromosomal locus of the gene, at an earlier, less aggressive stage of the disease, Borg, 
et al., "Her-2/neu Activity in Human Breast Cancer," Cancer Research 50, 4332-4337 (July 15, 1990). Proper assessment 
and management of breast cancer thus requires tests to measure the presence of Her-2 gene expression and Her-2 

25 gene chromosomal copy number. 

[0003] Chromosomal abnormalities such as Her-2 gene copy number can be assessed by assays using fluorescent 
in situ hybridization ("FISH"). FISH assays involve hybridization of DNA probes to chromosomal DNA present in mor- 
phologically intact metaphase spreads or interphase cells of tissue samples. The U.S. Food and Drug Administration 
recently approved a diagnostic FISH test, PathVysion™ Her-2, available from Vysis, Inc. (Downers Grove, Illinois) for 

30 detection of Her-2 copy number and prediction of outcome of adriamycin therapy in node positive breast cancer patients. 
[0004] Cancer also involves abnormalities in multiple genes, leading to multiple forms of the disease, as exemplified 
by breast cancer, wherein the Her-2 oncogene is not abnormal in the majority of cases. So-called "DNA Chip" or "micro- 
array" tests using hybridization to a two dimensional array of multiple nucleic acid probes attached to a solid substrate 
assess multiple gene expression abnormalities simultaneously. See for example, U.S. Patents 5,445,934, "Array of 

35 Oligonucleotides on Solid Substrate," Fodor, et al., 5,800,992, "Method of Detecting Nucleic Acids," Fodor, et al., and 
5,807,552, "Methods for Fabricating Microarrays of Biological Substances," Brown, et at. The microarray gene expression 
tests are of growing use in the development of new drugs targeted at particular diseases. 

[0005] Multiple gene expression at the protein level also can be examined by the use of "microdot" immunoassays, 
which are two dimensional arrays of immobilized antigens on a substrate. See U.S. Patent 5,486,452, "Devices and Kits 

40 for Immunological Analysis," Gordon, et al., priority date February 3, 1982, and Ekins, et al, Analytica Chimica Acta, 
227:73-96 (1989). The immobilized antigens of Gordon, et al. include nucleic acids and are disclosed as arrayed at 
densities of 10 5 per 10 square centimeters (or 1,000 per cm 2 ). Gordon, et al. further disclose the array has "intrinsic 
resolution" below the size of pipetting devices common in 1982, see Gordon, et al. at column 17, and can thus contain 
antigens at higher densities. Gordon, et al. disclose that the arrays can be manufactured by use of mechanical transfer 

45 apparatus, miniaturized applicators, lithographic procedures or high speed electronic printing. 

[0006] U.S. Patent 5,665,549, "Comparative Genomic Hybridization (CGH)," Pinkel, et at., discloses a method for 
simultaneous assessment of multiple genetic abnormalities. CGH involves the comparative, multi-color hybridization of 
a reference nucleic acid population labeled in one fluorescent color and a sample nucleic acid population labeled in a 
second fluorescent color to all or part of a reference genome, such as a human metaphase chromosome spread. 

so Comparison of the resulting fluorescence intensity at locations in the reference genome permits determination of copy 
number of chromosomal sequences, or of expressed gene sequences, in the sample population. Microarray-based CGH 
tests have also been disclosed for the assessment of multiple genomic DNA or gene expression abnormalities, see US 
Patent 5,830,645 (WO 96/17958), "Comparative Fluorescent Hybridization to Nucleic Acid Arrays, Pinkel, et al.; co- 
pending and commonly assigned U.S. Patent Application Serial Number09/085,625, "Improvements of Biological Assays 

55 for Analyte Detection," Muller, et al.; and Pinkel, et al., "High resolution analysis of DNA copy number variation using 
comparative genomic hybridization to microarrays," Nature Genetics, Vol. 20, Oct. 1998, pp. 207-211. Pinkel, et al. in 
Nature Genetics disclose the capability of CGH to a microarray target to detect a single copy change in genomic DNA. 
[0007] To date, assessment of gene expression and of chromosomal abnormalities requires separate tests on a tissue 
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sample, leading to extra sample processing and reagent costs. Separate testing for gene expression and chromosomal 
abnormalities can also require more tissue than is available. The prior art does not disclose simultaneous measurement 
of gene expression and chromosomal abnormalities with a multi-color hybridization to a microarray. It is an object of this 
invention to circumvent separate testing by performing simultaneous testing for gene expression and chromosomal 
5 abnormalities on a tissue sample. It is another object to simultaneously test gene expression and chromosomal abnor- 
malities on a single nucleic acid microarray. Other objects of the invention will be detailed below. 

SUMMARY OF THE INVENTION 

w [0008] The invention comprises a multi-color, comparative hybridization assay method using an array of nucleic acid 
target elements attached to a solid support for the simultaneous detection of both gene expression and chromosomal 
abnormalities in a tissue sample. The method of the invention employs a comparative hybridization of a tissue mRNA 
or cDNA sample labeled with a first detectable marker, a tissue genomic DNA sample labeled with a second detectable 
marker, and at least one reference nucleic acid labeled with a third detectable marker, to the array. Each marker's 

15 presence and intensity at each target element is detected and the ratios of the markers, for example, (1 ) of the first and 
third markers and (2) the second and third markers, are determined for each of the target elements. Gene expression 
and chromosomal abnormalities are thus simultaneously detected by analysis of the marker ratios. In a preferred em- 
bodiment, the markers are each fluorescent labels. Thus, in a first aspect, the present invention provides a method for 
simultaneous detection of gene expression and chromosomal abnormality in a tissue sample comprising: 

20 

(a) providing an array of nucleic acid target elements attached to a solid support wherein the nucleic acid target 
elements comprise polynucleotide sequences substantially complementary under preselected hybridisation condi- 
tions to nucleic acids indicative of gene expression and of chromosomal sequence of a tissue sample; 

(b) providing at least three labelled nucleic acid populations: 

25 

(i) a mRNA or cDNA population labelled with a first marker and derived from the tissue sample, 

(ii) a chromosomal DNA population labelled with a second marker and derived from the tissue sample, and 

(iii) at least one reference nucleic acid population labelled with a third marker; 

30 (c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and 

(d) detecting presence and intensity of each of the first, second and third markers to at least two target elements. 

[0009] The present invention also provides a method for simultaneous detection of gene expression and chromosomal 
abnormality in a tissue sample comprising: 

35 

(a) providing an array of nucleic acid target elements attached to a solid support wherein the nucleic acid target 
elements comprise polynucleotide sequences substantially complementary under preselected hybridisation condi- 
tions to nucleic acids indicative of gene expression and of chromosomal sequence of a tissue sample; 

(b) providing at least three labelled nucleic acid populations: 

40 

(i) a mRNA or cDNA population labelled with a first fluorescent colour and derived form the tissue sample, 

(ii) a chromosomal DNA population labelled with a second fluorescent colour and derived from the tissue sample, 
and 

(iii) at least one reference nucleic acid population labelled with a third fluorescent colour; 

45 

(c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and (d) detecting 
presence and intensity of each of the first, second and third fluorescent colours to at least two target elements. 
The present invention also provides a method of for simultaneous detection of gene expression and chromosomal 
abnormality in a tissue sample comprising: 

50 

(a) providing an array of nucleic acid target elements comprising genomic DNA attached to a solid support 
wherein the nucleic acid target elements comprise polynucleotide sequences substantially complementary under 
preselected hybridisation conditions to nucleic acids indicative of gene expression and of chromosomal se- 
quence of a tissue sample; 
55 (b) providing at least three labelled nucleic acid populations: 

(i) a mRNA or cDNA population labelled with a first fluorescent colour and derived form the tissue sample, 

(ii) a chromosomal DNA population labelled with a second fluorescent colour and derived from the tissue 
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sample, and 

- (iii) at least one reference nucleic acid population labelled with a third fluorescent colour; 

(c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and 
5 (d) detecting presence and intensity of each of the first, second and third fluorescent colours to at least two 

target elements. 

The invention has broad utility in human disease management by providing more complete genetic assessment 
10 data to guide therapy selection, in human and animal drug development programs by assessing therapeutic candidate 

effects, and in bacterial and viral pathogen diagnosis. Particular cancers, which are characterized by gene amplifi- 
cation coupled with over-expression of the mRNA for the amplified gene, may be more aggressive diseases and 
need more aggressive therapies. The mechanism that drives over-expression could be fundamental in understanding 
what therapeutic interventions may be appropriate. Thus, the characterization of both gene expression and ampli- 
15 fication by the methods of the invention can lead to improved cancer therapy. 

In a preferred embodiment, the invention comprises a method for simultaneous detection of gene expression and 
chromosomal abnormality in a tissue sample comprising: 

(a) providing a microarray of nucleic acid target elements attached to a solid support wherein the nucleic acid 
20 target elements comprise polynucleotide sequences substantially complementary under preselected hybridi- 
zation conditions to nucleic acids present in a tissue sample, which are indicative of gene expression and 
indicative of chromosomal sequence; 

(b) providing at least three labeled probe nucleic acid populations: 

25 (j) a cDNA population labeled in a first fluorescent color and derived from mRNA from the tissue sample, 

(ii) a chromosomal DNA population labeled in a second fluorescent color and derived from the tissue sample, and 

(iii) at least one reference nucleic acid population labeled in a third fluorescent color; 

(c) contacting the microarray with the labeled nucleic acid populations under hybridization conditions; and 
30 (d) detecting presence and intensity of each of the first, second and third fluorescent label colors on at least two 

target elements. 

[0010] Measurement and comparison of hybridization of message, genomic and reference nucleic acids at the same 
target elements provides the simultaneous assessment of expression and genomic changes. The invention also com- 

35 prises use of multiple reference nucleic acids, for example, a genomic reference DNA labeled in the third fluorescent 
color and a reference cDNA population labeled in a fourth fluorescent color. The nucleic acid target elements can be 
either genomic DNA, oligomer DNA or cDNA. A preferred embodiment comprises an array with a mixture of genomic 
DNA target elements and oligomer DNA or cDNA target elements, with the oligomer DNA/cDNA targets measuring 
expression and the genomic DNA targets measuring chromosomal change. It is also preferred to use a microarray having 

io a target element density capable of measuring 1 ,000 different gene and genomic loci in less than one square centimeter 
of chip surface. 

BRIEF DESCRIPTION OF THE DRAWINGS 
45 [0011] 

Figures 1(a) through 1 (e) depict the components of a preferred hybridization cartridge for use in performing the 
inventive methods. 

Figures 2(a) through 2(h) depict data from a nucleic acid microarray after hybridization with tissue cDNA and genomic 
so DNA populations, each derived from a human cancer cell line, one labeled red, the other green, and a total human 

genomic DNA reference population labeled orange, which show the capability of the method of the invention to 
detect simultaneously both gene expression and chromosomal abnormalities on the same nucleic acid microarray. 

DETAILED DESCRIPTION OF THE INVENTION 

55 

(1) Definitions 

[0012] The following abbreviations are used herein: 
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bp - base pair 

CGH - Comparative Genomic Hybridization 
DAPI - 4, 6 diamidino-2-phenylindole 
dCTP - deoxycytosine triphosphate 
s DNA - deoxyribonucleic acid (in either single- or double-stranded form, including analogs that can function in a 

similar manner) 

dUTP - deoxyuridine triphosphate 
FISH - fluorescence in situ hybridization 
kb - kilobase 
10 mm - millimeter 

mRNA - messenger RNA 
ng - nanogram 
nl - nanoliter 

RNA - ribonucleic acid in either single- or double-stranded form, including analogs that can function in a similar manner 
15 |xg - microgram 

jjlI - microliter 
ixm - micrometer 
p,M - micromole 



20 [0013] The term "nucleic acid" or "nucleic acid molecule" refer to a deoxyribonucleotide or ribonucleotide polymer in 
either single- or double-stranded form, including known analogs of natural nucleotides that can function in a similar 
manner as naturally occurring nucleotides. 

[0014] The term "exon" refers to any segment of an interrupted gene that is represented in the mature mRNA product. 
Some protein coding genes do have exons that are non-coding, e.g., exon 1 of the human c-myc gene. Perhaps all 
25 protein coding genes have first and last exons that are partially coding. 

[001 5] The terms "single copy sequence" or "unique sequence" refer to a nucleic acid sequence that is typically present 
only once per haploid genome, such as the coding exon sequences of a gene. 

[0016] The term "complexity" is used herein according to standard meaning of this term as established by Britten, et 
a!., Methods ofEnzymoL, 29:363 (1974). See also Cantor and Schimmel, Biophysical Chemistry: Part ///at 1228-1230, 
30 for further explanation of nucleic acid complexity. 

[001 7] The term "target element" refers to a region of a substrate surface that contains immobilized or attached nucleic 
acids capable of hybridization to nucleic acids isolated from a tissue sample. 

[001 8] "Bind(s) substantially" refers to complementary hybridization between a tissue nucleic acid and a target element 
nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization 

35 media to achieve the desired detection of the tissue polynucleotide sequence. 

[001 9] The terms "specific hybridization" or "specifically hybridizes with" refers to hybridization in which a tissue nucleic 
acid binds substantially to target element nucleic acid and does not bind substantially to other nucleic acids in the array 
under defined stringency conditions. One of skill will recognize that relaxing the stringency of the hybridizing conditions 
will allow sequence mismatches to be tolerated. The degree of mismatch tolerated can be controlled by suitable adjust- 

40 ment of the hybridization conditions. 

[0020] One of skill will also recognize that the precise sequence of the particular nucleic acids described herein can 
be modified to a certain degree to produce tissue nucleic acid probes or target element nucleic acids that are "substantially 
identical" to others, and retain the ability to bind substantially to a complementary nucleic acid. Such modifications are 
specifically covered by reference to individual sequences herein. The term "substantial identity" of polynucleotide se- 

45 quences means that a polynucleotide comprises a sequence that has at least 90% sequence identity, and more preferably 
at least 95%, compared to a reference sequence using the methods described below using standard parameters. 
[0021] Two nucleic acid sequences are said to be "identical" if the sequence of nucleotides in the two sequences is 
the same when aligned for maximum correspondence as described below. The term "complementary to" is used herein 
to mean that the complementary sequence is complementary to all or a portion of a reference polynucleotide sequence. 

so [0022] Sequence comparisons between two (or more) polynucleotides are typically performed by comparing sequenc- 
es of the two sequences over a "comparison window" to identify and compare local regions of sequence similarity. A 
"comparison window," as used herein, refers to a segment of at least about 20 contiguous positions, usually about 50 
to about 200, more usually about 100 to 150, in which a sequence may be compared to a reference sequence of the 
same number of contiguous positions after the two sequences are optimally aligned. 

55 [0023] Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith 
and Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. 
Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. (U.S.A.) 85: 
2444 (1988), and by computerized implementations of these algorithms. 
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[0024] "Percentage of sequence identity" is determined by comparing two optimally aligned sequences over a com- 
parison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions 
or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for 
optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which 
5 the identical nucleic acid base occurs in both sequences to yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to 
yield the percentage of sequence identity. 

[0025] Another indication that nucleotide sequences are substantially identical is if two molecules hybridize to the 
same sequence under stringent conditions. Stringent conditions are sequence dependent and will be different in different 

10 circumstances. Generally, stringent conditions are selected to be about 5° to about 25° C. lower than the thermal melting 
point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic 
strength and pH) at which the strands of a DNA duplex or RNA-DNA hybrid are half dissociated or denatured. 
[0026] As used herein, a "probe" is defined as a population or collection of tissue nucleic acid molecules (either RNA 
or DNA) capable of binding to a target element comprising nucleic acid of complementary sequence through one or 

15 more types of chemical bonds, usually through hydrogen bond formation. The probe populations are directly or indirectly 
labeled as described below. The probe populations are typically of high complexity, for instance, being prepared from 
total genomic DNA or total mRNA isolated from a tissue cell or tissue cell population. 

(2) Overview 

20 

[0027] The methods of the invention combine the capability of assessment of a large number of nucleic acids provided 
by microarray test formats with the multi-color, comparative hybridization power of CGH to assess simultaneously both 
gene expression and genomic abnormalities in the same tissue sample. The methods of the invention employ hybridization 
under suitable hybridization conditions to a nucleic acid array comprising multiple nucleic acid target elements of nucleic 

25 acid populations derived from a tissue sample. The nucleic acid target elements comprise either genomic DNA, oligomer 
or cDNA nucleic acids complementary to expressed gene sequences, or a mixture of the two. The nucleic acid populations 
are separately labeled with different detectable markers and comprise (1) a mixture of mRNA or its complementary 
cDNA, which is representative of gene expression in the tissue sample, and (2) a mixture of genomic DNA, which is 
representative of the genomic status of the tissue sample. The labeled nucleic acid populations are cohybridized to the 

30 array with one or more reference nucleic acid populations, with each reference population also labeled with its own 
different detectable marker. Preferably, all of the nucleic acid populations applied to the array are each labeled with 
different fluorescent markers. The reference nucleic acid or nucleic acids is or are chosen to permit assessment of the 
gene expression state and genomic state of the tissue sample relative to the reference or references. After a suitable 
hybridization time, the fluorescent color presence and intensity are detected at each target element of the array. Com- 

35 parison of the fluorescent ratios between colors at a particular target element provides measurement of the copy number 
for genomic DNA sequences and for cDNA sequences, which are complementary to that target element. 
[0028] A genomic DNA sequence generally contains both one or more "exon" sequences, which code for all or part 
of the RNA expressed gene sequence, and one or more "intron," non-coding sequences, which also often contain repeat 
sequences replicated at many points in the human genome. A genomic target element can thus serve as a hybridization 

<o target for the expressed gene sequences that map to the particular genomic sequence. Similarly, a target element 
complementary to a particular expressed gene sequence is also complementary to the exon sequences of genomic 
DNA. Hence, a genomic DNA target element and a cDNA target element can each be used in an array format for 
hybridization to either genomic DNA or expressed gene sequence nucleic acids. The array format used in the methods 
of the invention comprises a microarray of separate nucleic acid target elements each complementary to (1 ) a particular 

45 genomic DNA sequence or (2) a particular expressed gene sequence. A mixture of target elements comprising some 
target elements complementary to (1 ) and some complementary to (2) can also be used. 

[0029] A significant advantage of the methods of the invention is the simultaneous determination of both gene expres- 
sion and chromosomal abnormality. Some aggressive, virulent forms of cancer are characterized by both over-expression 
of one or more oncogenes and gene amplification of the chromosomal locus of each oncogene, such as breast cancer 

so involving Her-2. Testing for over-expression of the oncogene alone is inadequate for the complete characterization of 
the disease state. Simultaneous testing of the same tissue sample for both gene expression and chromosomal abnor- 
malities with the methods of the invention thus advantageously identifies both over-expression and the molecular causes 
of over-expression and thereby enables appropriate prognostic assessment and therapy selection. 
[0030] The choice of genomic, cDNA or a mixture of target elements can vary with the tissue and analysis sought. 

55 For example, cDNA target elements are advantageous because the effect of repeat sequences present in some genomic 
DNAs is decreased and more precise detection of expressed genes is possible. Genomic DNA target elements are 
advantageous because the higher complexities can produce greater signal. A mixture of genomic DNA and cDNA target 
elements can also be used to provide more detailed genomic and expression analysis. 
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(3) Nucleic Acids in the Target Elements 

[0031] The nucleic acid sequences of the target elements can comprise any type of nucleic acid or nucleic acid analog, 
including without limitation, RNA, DNA, peptide nucleic acids or mixtures thereof, and can be present as clones also 
s comprising vector sequences or can be substantially pure. Arrays comprising peptide nucleic acids are disclosed in U.S. 
Patent 5,821,060, "DNA Sequencing, Mapping and Diagnostic Procedures Using Hybridization Chips and Unlabeled 
DNA," 

H. Arlinghaus, et al. 

10 

[0032] The nucleic acids of a target element typically have their origin in a defined region of a selected genome (for 
example a clone or several contiguous clones from a human or animal genomic library), or correspond to a functional 
genetic unit of a selected genome, which may or may not be complete (for example a full or partial cDNA sequence). 
The target nucleic acids can also comprise inter-Alu or Degenerate Oligonucleotide Primer PCR products derived from 
15 cloned DNA. 

[0033] The nucleic acids of a target element can, for example, contain specific genes or be from a chromosomal region 
suspected of being present at increased or decreased copy number in cells of interest, e.g., tumor cells. For example, 
separate target elements can comprise DNA complementary to each of the oncogene loci listed in Table 2 below. The 
target element may also contain an mRNA or cDNA derived from such mRNA, suspected of being transcribed at abnormal 

20 levels, for example, expressed genes mapping to the gene loci in Table 2 below. 

[0034] Alternatively, a target element may comprise nucleic acids of unknown significance or location. An array of 
such elements could represent locations that sample, either continuously or at discrete points, any desired portion of a 
genome, including, but not limited to, an entire genome, a single chromosome, or a portion of a chromosome. The 
number of target elements and the complexity of the nucleic acids in each would determine the density of analysis. For 

25 example, an array of 300 target elements, with each target containing DNA from a different genomic clone, could sample, 
i.e., analyze, the entire human genome at 1 0 megabase intervals. An array of 3,000 target elements, with each containing 
100 kb of genomic DNA, could give substantially complete coverage at one megabase intervals of the unique sequence 
regions of the human genome. Similarly, an array of target elements comprising nucleic acids from anonymous cDNA 
clones or complementary to Expressed Sequence Tags ("ESTs") would permit identification of those expressed gene 

30 sequences that might be differently expressed in some cells of interest, thereby focusing attention on study of these 
genes or identification of expression abnormalities for diagnosis. 

[0035] One of skill will recognize that each target element can comprise a mixture of target nucleic acids of different 
lengths and sequences. A target element will generally contain more than one copy of a cloned or synthesized piece of 
DNA, and each copy can be broken into fragments of different lengths. The length and complexity of the target element 
35 sequences of the invention is not critical to the invention. One of skill can adjust these factors to provide optimum 
hybridization and signal production for a given hybridization procedure, and to provide the required resolution among 
different genes or genomic locations. 

[0036] The target elements can comprise oligomers, such as those in the range of 8 to about 100 bp, preferably 20 
to 80 bp, and more preferably about 40 to about 60 bp, which can be readily synthesized using widely available synthesizer 

40 machines. Oligomers in target elements can also be synthesized in situ on the array substrate by any methods, such 
as those known in the art. The oligomer sequence information can be obtained from any convenient source, including 
nucleic acid sequence data banks, such as GENBANK, commercial databases such as LIFESEQ from Incyte Pharma- 
ceuticals, Inc. (Palo Alto, California), or EST data such as that produced by use of SAGE (serial analysis of gene 
expression). For oligomer or partial cDNA elements, one need only synthesize a partial sequence complementary to a 

45 part of the mRNA for the gene or complementary to an identifiable, critical sequence for the gene (critical in the sense 
of the sequences coding for the functional parts of the expressed protein, i.e., of the receptor binding site). 
[0037] The target elements can comprise partial or full-length cDNA sequences, either synthesized for smaller cDNAs 
or cloned, preferably having a complexity in the range of about 100 bp to about 5,000 bp. cDNA target elements can be 
readily obtained from expressed gene sequence cDNA libraries from a desired tissue, which are produced using con- 

50 ventional methods or obtained from commercial sources, such as the libraries maintained by Genome Systems, Inc. 
(St. Louis, Missouri), Research Genetics (Huntsville, Alabama) and Clonetech (South San Francisco, California). 
[0038] The target elements can comprise genomic DNA sequences of any complexity, but generally of a complexity 
of about 20,000 bp to about 250,000 bp, and preferably about 50,000 bp to about 175,000 bp. Genomic DNA can be 
obtained from any mapped genomic clones produced by standard cloning procedures or obtained from commercial 

55 sources, such as the chromosome specific libraries maintained by the American Type Culture Collection (Rockville, 
Maryland), hereinafter ATCC. A preferred genomic library source is the human DNA BAC library maintained by Genome 
Systems. 

[0039] The identification of genomic DNA or cDNA selected for use in the target elements can be determined by the 
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location of chromosomal sequences known or identified as amplified or deleted or of genes over- or under-expressed. 
The identification of genomic or cDNA clones is done by designing primer sequence pairs using, for example, genetic 
data in Gene Map '98 maintained by the U. S. National Institute of Health or the Genome Data Base at 
[0040] http://gdbwww.gdb.org/gdbtop.html. For example, the Her-2 gene is believed to comprise about 40 kb of ge- 
s nomic sequence and a PCR primer pair can be designed based upon the published Her-2 sequence. The PCR primer 
pair or the PCR amplicon product can then be used to screen a genomic DNA library to identify clones containing 
complementary sequences. The genomic DNA clones identified in the screen can be used on an array in the method 
of the invention to identify genomic abnormality at the Her-2 locus. 

[0041] For use of arrays that detect viruses and viral gene expression simultaneously with detection of human genetic 
10 abnormalities, the target elements can comprise sequences complementary to known or identified viral sequences. The 
array target elements can also be designed to detect viral integration sites in the human or an animal genome. Use of 
such a pathogen array is medically significant, for example, because of the known ties of human papilloma virus to 
human cervical cancer and h. pylori to human gastrointestinal cancer. Similarly, known bacterial gene sequences can 
be used to design the nucleic acids of the target elements. Use of pathogen sequence based arrays also can be used 
15 in food and environmental testing. 

(4) Target Elements 

[0042] The target elements can be of varying dimension, shape and area. The target elements can comprise physically 
20 separated spots produced by printing methods, for example, mechanical transfer, gravure, ink jet or imprint methods. 
The target elements also can be closely abutted such as those produced by the photolithographic in situ array synthesis 
of U.S. Patent 5,445,934. The target elements are preferably generally round in shape on a planar surface. Generally, 
smaller elements are preferred, with a typical target element comprising less than 500 microns in diameter. Particularly 
preferred target element sizes are between about 5 microns and 250 microns in diameter to achieve high density. 
25 [0043] The target element density can be any desired density and is preferably one typical of nucleic acid microarrays, 
i.e. greater than about 100 target elements per square centimeter. For the preferred use in human disease management, 
the target element density is preferably in the range of about 100 to about 10,000 target elements per square centimeter 
of chip surface. Higher or lower densities can be desirable and higher densities can be preferred for use in drug devel- 
opment to permit examination of higher numbers of expressed gene sequences. 

30 

(5) Array Manufacture 

[0044] The microarray can be manufactured in any desired manner and both robotic deposition and synthesis in situ 
methods for array manufacturing are known. See for example, U.S. Patents 5,486,452, 5,830,645, 5,807,552, 5,800,992 

35 and 5,445,934. It is preferred to manufacture the microarray using a robotic deposition method and apparatus, which 
employs robotic deposition of nucleic acids through a capillary needle or pin as disclosed in co-pending, commonly 
assigned U.S. Patent Application Serial Number 09/085,625, filed May 27, 1998, "Improvements of Biological Assays 
for Analyte Detection," Muller, et al. (hereinafter "Muller, et al. M ), to produce a two dimensional microarray of physically 
separated or "spotted" target elements immobilized in rows and columns on a chromium coated-substrate. 

40 [0045] A robotic applicator with multiple capillary needles can be used. A single needle applicator using a pin which 
is washed between applications of different nucleic acids, or using a robotic pin changer also can be used. The needle 
used is preferably a 33 gauge, one-inch long stainless steel capillary syringe needle. The needle is connected to a 
nucleic acid reservoir, preferably a Luer lock syringe tip. A preferred needle and reservoir is available commercially from 
EFD, (East Providence, Rhode Island). It is preferred to use multiple capillary needles, each depositing a different nucleic 

45 acid, thereby eliminating a washing step between depositions. 

[0046] Any suitable amount of nucleic acid is deposited in each target element, with the target element size dependent 
on the amount deposited. For each target element, the amount can be from about 0.05 nl to about 5.0 nl of a nucleic 
acid solution of 1 p.g/|j.l nucleic acid concentration. For a density of 1,000 target elements/cm 2 , the individual amount 
deposited per target element is about 0.2 nl to about 2.0 nl of 1 p.g/p.1 solution. The nucleic acid is provided in any solvent 

50 that will permit deposition of denatured nucleic acid. Preferably, the nucleic acid is provided in 100 mM NaOH at 1 jtg/ 
fjil concentration. 

[0047] To assist robotic manufacturing, automated tracking and labeling methods and apparatus can be used, for 
example, in delivering the correct nucleic acid for deposition at a particular target element. For example, bar coding or 
transponder labeling or tracking of capillary pins containing different nucleic acids are useful to assure delivery of the 
55 correct nucleic acid to the desired target element. The use of bar coding or transponder labeling also permits better 
computer control of the manufacturing process. 

[0048] A microarray comprising both cDNA and genomic DNA target elements can be produced in any arrangement. 
For example, the cDNA elements can be located in one portion of the array or can be interspersed among the genomic 
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DNA target elements. Although the regularity of a two dimensional array on a planar substrate surface is preferred to 
permit easy fluorescence detection and analysis, the array can be manufactured in any desired configuration. 
[0049] Individual target elements can appear only once or can be replicated to provide statistical power to analysis of 
results. For arrays with densities under 3,000 target elements per cm 2 , it is preferred to manufacture the array so that 
s each target element is replicated three times on the array, to provide better calibration of the results. Applicants have 
determined that when using a microarray of less than one cm 2 of substrate surface area, the replicates can be placed 
adjacent each other or separated without material effect on the results. 

[0050] Preferably, individual microarrays are manufactured on a large, substrate plate or wafer, which is scored using 
procedures well known in the semiconductor industry for breakup into individual chips. Chromium-coated glass plates 

10 or wafers are available commercially from Nanofilm (Westlake Village, California) and can be scored using conventional 
procedures. Thus, multiple chips can be manufactured at once on the same wafer with one robotic applicator, and then 
separated into individual chips. Before printing, the wafers are preferably washed using, in order, distilled water, isopro- 
panol, methanol and distilled water washes. Nitrogen is used to blow-off excess water and the rinsed wafers are dried. 
[0051] The preferred Muller, et al. apparatus uses X-Y and Z axis controllers for the capillary pin applicator with 

15 application of a burst of low air pressure to deposit each nucleic acid. It is further preferred to use a suitable Z-axis 
controller on the apparatus of Muller, et al. to avoid contact of the capillary pin with the substrate surface. Positioning 
the pin above the surface , preferably about 1 00 jim above, permits better spot size regularity and use of lower air pressure. 
[0052] When beginning printing, the plate or wafer is equilibrated to room temperature. The Z-axis height of each chip 
is then determined for use by the robot controller. Preferably, the printing starts with deposition of a y, diameter "marker" 

20 spot in one corner of each chip for alignment control. The nitrogen pressure is low, preferably about 1 psi or less, and 
is a pressure sufficient to deposit the particular nucleic acid given its viscosity and amount to be deposited. The nitrogen 
pulse length is generally about 10 milliseconds 

[0053] It is also preferred to include various control target elements such as, for example, target elements comprising: 
(1) total genomic DNA, (2) vector DNA, (3) a pooled mixture of genomic DNA or cDNA from each target element, (4) 
25 total RNA from a normal tissue, or (5) total genomic or cDNA from a tissue with known abnormalities. The control target 
elements can also include a series of target elements each comprising a nucleic acid of known copy number for a 
particular expressed gene or genomic sequence. For example, genomic DNA extracted from cell lines with 1, 2, 3, 4 
and 5 copies of the human X chromosomes can be used. 

[0054] For quality control of the preferred robotic deposition manufacturing, it is preferred to image the produced arrays 
30 using a stereo microscope and a CCD camera. An image of each chip is captured and analyzed. Chips with missing, 
missized or misshaped target elements are identified and marked. 

[0055] When using cloned cDNA or cloned genomic DNA, the vector sequences can be removed before deposition 
with any suitable process or retained if they do not significantly interfere with the hybridization. For cloned genomic DNA 
and cDNA, it is preferred to not remove the vector sequences. 

35 [0056] Any suitable substrate can be used, including those disclosed in U.S. Patent 5,445,934 and 5,807,552. The 
substrate can be for example, without limitation, glass, plastics such as polystyrene, polyethylene, polycarbonate, polysul- 
fone and polyester, metals such as chromium and copper, metal coated substrates and filters of any material. The 
substrate surface bearing the immobilized nucleic acids is preferably planar, but any desired surface can be used 
including, for example, a substrate having ridges or grooves to separate the array target elements. The nucleic acids 

40 can also be attached to beads, which are separately identifiable. The planar chromium-coated glass substrate of Muller, 
et al. is preferred. 

[0057] The nucleic acids of the target elements can be attached to the substrate in any suitable manner that makes 
them available for hybridization, including covalent or non-covalent binding. The non-covalent attachment method of 
Muller, et al. is preferred. 

45 

(6) Tissue Nucleic Acids 

[0058] The nucleic acid populations can be derived from any tissue source, including human, plant and animal tissue. 
The tissue sample comprises any tissue, including a newly obtained sample, a frozen sample, a biopsy sample, a blood 

50 sample, an amniocentesis sample, preserved tissue such as a paraffin-embedded fixed tissue sample (i.e., a tissue 
block), or a cell culture. Thus, the tissue sample can comprise a whole blood sample, a skin sample, epithelial cells, soft 
tissue cells, fetal cells, amniocytes, lymphocytes, granulocytes, suspected tumor cells, organ tissue, blastomeres and 
polar bodies. The tissue to be tested can be derived from a microdissection process to produce a more homogeneous 
cell population. Paraffin fixed tissue is pre-treated with any suitable process to remove the wax, and a paraffin pretreatment 

55 kit is available commercially from Vysis, Inc. Any suitable amount of tissue can be used, including a single cell, such as 
a human blastomere cell to be tested during in vitro fertilization procedures. Where only one or a few cells are available, 
such as when testing human fetal cells separated from maternal blood samples, a nucleic acid amplification technique 
to amplify the amount of nucleic acid can be used. 
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[0059] The nucleic acid populations derived from the tissue are produced by any suitable nucleic acid separation or 
purification process. Nucleic acid separation methods for both genomic DNA and for messenger RNA are available 
commercially, such as the QIAamp tissue kit for DNA isolation from Qiagen. For example, mRNA can be extracted from 
the tissue and then converted to cDNA by treatment with reverse transcriptase. If insufficient cDNA is available, the 
5 cDNA can be amplified by polymerase chain reaction. This well known process is called RT/PCR. It is also possible to 
convert the cDNA into a complementary RNA ("cRNA"). 

[0060] In general, where greater than about one million cells of tissue are available, the tissue nucleic acids can be 
extracted and used without amplification. If less than about one million cells are available, a nucleic acid amplification 
or concentration is preferably used. Preferably, such an amplification technique is PCR. Care and appropriate controls 
10 should be used with PCR to avoid or identify any artefacts introduced. 

(7) Reference Nucleic Acids 

[0061] The reference nucleic acid population is any suitable nucleic acid collection chosen to serve as a reference. 
15 For example, the reference population can be total human genomic DNA from normal tissue, total mRNA extracted from 
a normal sample of the tissue to be tested and converted to cDNA, or a synthetic or naturally-occurring mixture of cDNA 
for particular expressed genes. The reference can be a cRNA population. The reference also can include a "spiked," 
known amount of a particular genomic or cDNA sequence to enable control analysis. 

20 (8) Labeling 

[0062] The labels used can be any suitable non-radioactive marker detectable by any detection method. For example, 
the labels can be fluorescent molecules or can be proteins, haptens or enzymes. Also, "mass spec" labels, such as 
different isotopes of tin, can readily be detected after hybridization to the array by laser removal and mass spectrometry 

25 process, such as MALDI (matrix-assisted laser desorption-ionization). See Wu, et al., Analytical Chemistry 66, 1637 
(1994) and Wu, et al., Rapid Communications in Mass Spectrometry, 7, 142 (1993). Preferably the labels are each 
fluorescent markers having sufficient spectral separation to be readily distinguished from each other without need of 
extensive "cross-talk" correction, such as fluorescein, Texas Red and 5-(and 6-)carboxytetramethyl rhodamine. An 
extensive list of fluorescent label compounds useful for attachment to nucleic acids appears in U.S. Patent 5,491 ,224, 

30 "Direct Label Transaminated DNA Probe Compositions for Chromosome Identification and Methods for their Manufac- 
ture," Bittner, et al. Fluorescent compounds suitable for use are available commercially from Molecular Probe (Eugene, 
Oregon). Indirect labels, such as biotin and phycoerythrin, that are fluorescently labeled after hybridization to the array 
by contact with a fluorescent protein, such as avidin labeled with fluorescein, also can be used. 
[0063] The reference population(s) and the tissue nucleic acid populations are labeled in any suitable manner, such 

35 as by end labeling, nick translation or chemical transformation. Preferably, during either the RT or PCR processing, a 
label incorporation step is used to label the resulting cDNA in a desired fluorescent color. The separated chromosomal 
DNA can be labeled using any suitable labeling chemistry, including end-labeling, nick translation and chemical labeling. 
It is preferred to use nick translation to label the chromosomal DNA in a suitable fluorescent color using a fluorescent 
dUTP or dCTP. Manufacture of suitable fluorescently labeled dCTP is disclosed in K. Cruickshank, Anal. Biochemistry, 

40 "Quantitation of Fluorescent Nucleotide Incorporation by Capillary Gel Electrophoresis and Laser Induced Fluorescent 
Detection," (in press), hereinafter referred to as "Cruickshank." Suitable nick translation kits are available commercially. 
[0064] Preferably, for use of total human genomic DNA as the reference population, the labeling is done by a bisulfite- 
catalyzed transamination process as disclosed in U.S. Patent 5,506,350, "Production of Chromosome Region Specific 
DNA Sequences and Transamination," Bittner, et al. Total human genomic DNA labeled by such a process is available 

45 commercially from Vysis, Inc. (Downers Grove, Illinois). 

[0065] The labeling method used preferably results in a label content of each nucleic acid population of about 0.3 to 
about 6.0 mole percent labeled nucleotides when using direct attachment of fluorophores to the nucleic acids. The 
quantities of each labeled tissue nucleic acid and reference nucleic acid to be used are preferably in the range of about 
100 ng to about 1 /*g, preferably about 300 ng to about 425 ng. 

50 

(9) Array Hybridization 

[0066] The tissue and reference nucleic acid populations are hybridized to the array under suitable hybridization 
conditions, i.e., stringency, for a time selected to permit detection of hybridization of single copy genomic sequences. 
55 The hybridization conditions include choice of buffer, denaturant, such as formamide, salt additives and accelerant. 
Hybridization buffers containing formamide and dextran sulfate at specified pH and salt conditions, such as LSI Hybrid- 
ization Buffer (Vysis, Inc.), are available commercially. The buffer will preferably have a pH of about 6.8 to about 7.2, a 
salt content of about 1.5X SSC to about 2.5X SSC, and a formamide content of about 40-50%. Suitable conditions can 
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include a temperature of about 40 to about 80 degrees centigrade for a time sufficient to detect signal over background 
for both genomic and expression of about 1 to about 72 hours, preferably 1 2-24 hours. Hybridization accelerators, such 
as dextran sulfate, can be used if desired. Adequate diffusion of the tissue and nucleic acid populations into contact with 
all target elements is necessary. This can be achieved by simple diffusion, or by accelerating diffusion or overcoming 
5 diffusion limitations using any suitable means including mechanical mixing, such as by rocking, or fluidic diffusion, such 
as by microfluidic pumping of the labeled populations in and out of a hybridization chamber containing the array. The 
post-hybridization wash is preferably at a stringency greater than that of the hybridization. 

[0067] When using an array comprising human genomic DNA target elements, it is also preferable to add to the 
hybridization mix an excess of unlabeled human repeat sequence DNA, such as Cot1 DNA available from Life Tech- 
10 nologies, Inc., to suppress the non-specific signal resulting from hybridization of labeled repeat sequences present in 
the tissue nucleic acid population or in a reference genomic DNA, if used. Use of unlabeled repeat sequence DNA is 
generally in amounts of about 0.02 to about 5.0 ^g per 1 ng of total labeled genomic DNA (both tissue and reference), 
and preferably about 0.1 to 0.5 j*g per 1 ng total labeled genomic DNA. 

[0068] The hybridization can be performed in any suitable apparatus that will maintain the populations in contact with 
15 the array for a suitable time. For example, the labeled populations can be added to the array, covered with a cover slip 
and then incubated in an oven at the preselected temperature. Preferably, a cover slip designed to provide a desired 
hybridization volume between its bottom surface and the top of the array substrate is used. The labeled populations can 
be added to an array contained in a sealed cartridge apparatus, such as disclosed in European Patent Application 0 
695 941 A1, "Method and Apparatus for Packaging a Chip," published 7 February 1996, by microfluidic injection and 
20 circulation. The hybridization also can be carried out in a miniaturized hybridization and assay chip, such as that disclosed 
in PCT Patent Application WO 97/02357, "Integrated Nucleic Acid Diagnostic Device," published 23 January 1 997. Such 
miniaturized chips are referred to as manufactured on a mesoscale, i.e., manufactured having volumes for fluid pathways 
and reaction chambers measured in amounts of 1 0" 8 and 1 0 -9 liters 

[0069] Figures 1(a) through 1(e) show components of a preferred hybridization cartridge. Figure 1(a) displays the first 

25 component, a chromium coated glass "chip" 30 containing the immobilized nucleic acid target elements 31 of the micro- 
array 32. The microarray 32 is preferably located in the center of the chip 30, as shown. In a preferred format, the chip 
is 25.4 mm long x 16.93mm wide x 0.7mm thick; and the microarray covers a 10.5mm long x 6mm wide area. Shown 
in Figure 1(b), the second component is a "probe clip" 33, depicted with two alternate shapes, square and circular, for 
"array window" 34. The probe clip 33 can be made from any suitable material, preferably plastic. The array window 34 

30 is of a clear material, and is located and sized to permit ready imaging of the microarray. The probe clip 33 forms a 
hybridization chamber and fits snuggly over the array as a retainer and protective cover. Preferably, the array window 
34 is 1.27mm in diameter, centrally located in a 25.4mm long x 16.76mm wide probe clip 33. 
[0070] Figures 1(c) and 1(d) are top and side views of the fourth component, a chip holder 36, preferably made of a 
sturdy, injection moldable plastic, such as high-impact polystyrene, which is capable of withstanding necessary hybrid- 

35 ization temperatures without loss of physical stability. The chip holder 36 can be of any desirable dimension for holding 
the chip, and preferably is 25.4mm wide x 76.2mm long x 3.2 mm thick. As shown, near one end, the chip holder 36 
contains a cavity 37, preferably 26mm long x 18.5mm wide x 1.7mm deep, sized to accept the chip 30 bearing the 
microarray 32. The cavity 37 along its length is also slightly wider, preferably 0.5mm on each side, to create an access 
gap 38 to permit easier addition and removal of the probe clip and microscope cover slip. The surface of the cavity 

40 bottom is scored with shallow grooves to facilitate spreading of adhesive or fixative designed to hold the chip in place. 
The chip holder 36 at the end opposite the cavity 37 can be lightly scored across the width of the holder on its upper 
surface to provide a more grippable surface for the user. The chip holder bottom can be grooved to facilitate alignment 
in an array reader. 

[0071] In manufacture of the completed cartridge, a microarray with desired target elements is manufactured as 
45 described above, and is then glued with any suitable adhesive into the bottom of cavity 37. The chip holder 36 bearing 
the array can then be shrink wrapped, and enclosed in a kit with the probe clip 33, a cover slip used in array imaging, 
and any other desirable reagents for labeling or extracting nucleic acids and/or performing the hybridization. To carry 
out the method of the invention, the user applies the hybridization solution comprising an appropriate buffer and the 
labeled nucleic acid populations (reference and tissue) to the surface of the microarray, and places the probe clip 33 on 
so top of the microarray. The completed cartridge is depicted in Figure 1(e). Also shown superimposed in Figure 1(e) is 
the camera field of view 35 for the preferred imaging system of Che (US09/049.798, March 27, 1998, D. Che) . The 
cartridge is then incubated in an oven, with desired humidity control at the desired hybridization temperature for the 
desired time. 

[0072] When the hybridization is completed, the probe clip 33 is removed and the chip washed at a desired stringency, 
55 preferably, in order with 2X SSC at room temperature for 5 minutes, with 2X SSC and 50% formamide at 40° C for 30 
minutes, and 2X SSC at room temperature for 10 minutes, to remove hybridized probe. Gel/Mount (Biomeda, Foster 
City, California) and DAPI is applied to the array and a 18 mm x 18mm glass microscope cover slip is sealed over the 
array, still in holder 36. The covered chip is then imaged to detect the hybridization results. 
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(10) Array Detection 

[0073] After hybridization, the fluorescence presence and intensity for each label color is detected and determined by 
any suitable detector or reader apparatus and method. Laser-based array scanning detectors are known to the art, see 

5 U.S. Patent 5,578,832, "Method and Apparatus for Imaging a Sample on a Device," Trulsen, et al. Optical waveguide 
detection methods for array hybridization also have been disclosed, see U.S. Patent 5,843,651 , "Light Scattering Optical 
Waveguide Method for Detecting Specific Binding Events," D. Stimpson, et at. Preferably, a large field imaging apparatus 
and method, such as disclosed in co-pending, commonly assigned U.S. Patent Application Serial Number 09/049,798, 
"Large-Field Fluorescent Imaging Device," filed March 27, 1998, D. Che, (herein referred to as "Che") is used. 

10 [0074] The large-field fluorescence imaging apparatus of Che uses reflective optics to couple the excitation beam 
generated by a high-power white light source onto the microarray surface to provide a high illumination intensity, and 
combines the high illumination intensity with the high detection efficiency-of an array detector to provide a high image 
acquisition rate. The white light generated by the light source is collimated and filtered with a computer-controlled filter 
to provide the excitation beam. The excitation beam is passed through a field stop to form a well-defined beam pattern 

15 and then projected onto the array surface with a concave mirror. The concave mirror is disposed to image the field stop 
on the sample to define an illumination area which matches the field of view of the imaging optics. The fluorescent light 
generated in the sample is color filtered to reject scattered light of excitation color and imaged by the imaging optics 
onto the array detector to produce a fluorescent image of the sample. 

[0075] The array imaging apparatus and method may employ digital image processing algorithms used in a pro- 
20 grammed computer for data analysis, storage and display of digital image data from the imaging apparatus. Any suitable 
digital image processing, data storage and display software can be used for analysis of the array hybridization results. 
Digital imaging methods are known to those skilled in the art, for example, as disclosed in U.S. Patent 5,665,549, 
"Comparative Genomic Hybridization," Kallionemi, et al., and U.S. Patent 5,830,645. 

[0076] The hybridization images are preferably captured and analyzed by use of a high resolution digital imaging 

25 camera, such as a SenSys 1600 Camera with PSI interface from Photometries (Scottsdale, Arizona), which receives 
the large field image directly from the detection optics. Any other suitable camera can also be used. The raw image data 
captured by the camera is stored in any suitable computer data base or data storage file. The raw image data is processed 
using suitable image analysis algorithms to determine the marker intensity at each target element of the microarray. 
Image analysis algorithms are well known to those skilled in the art, and a package of a large number of such algorithms 

30 js available as IPLab from Scanalytics (Fairfax, Virginia.) 

[0077] Preferably, the image analysis algorithms carry out the following operations, implemented in appropriate com- 
puter software: (i) background correction, as necessary; (ii) array target element or "spot" segmentation for identification 
of individual array elements; (iii) spot grid assignment of a column and row number to each spot; (iv) spot data analysis, 
including verification of validity and presence of artifacts, averaging of data for replicate spots, normalization of data 

35 from all spots, and multi-experiment comparison and analysis; (v) single spot calculations, including the total intensity 
of each fluorescent marker color, the average DAPI counterstain intensity, the mean, mode, median and correlation 
coefficient of the per pixel ratios of fluorescent intensities, and the ratio of total tissue nucleic acid marker intensity to 
reference intensity, termed as the "mass ratio"; (vi) target summary analysis, including the number of valid replicates 
for a spot, the mean and coefficient of variation of the per spot mass ratios and the correlation coefficient of per pixel 

40 ratios across all spots. Preferably, the image analysis used standardizes the mean mass ratio such that the modal value 
is 1.00 using a window-based estimate of the mode. 

[0078] The fluorescent data at each target element can be compared automatically to produce the ratio between any 
desired tissue and reference or between tissues. For example, when using four tissue nucleic acids (primary tumor 
genomic DNA and cDNA and metastasis genomic DNA and cDNA) with two references (total genomic and total cDNA 
45 from normal tissue of the same cell type as the tumor), at least eight different ratios can be calculated (the ratio of each 
reference with each tissue). 

[0079] The image analysis also preferably comprises implementation of criteria set by the individual user for valid 
analyses, including (vii) exclusion of spots with pixels having saturated tissue or reference color channels; (viii) spot size 
and shape criteria for exclusion; and (ix) a "relation coefficient" exclusion for spots with relative coefficient values below 

50 threshold. The array data analysis can also include comparison algorithms to compare data from individual tests to data 
bases containing disease genotypes and phenotypes (i.e. listing of gene expression and chromosome abnormalities for 
particular diseases), which can identify possible diagnosis or choice of therapy based upon individual test results. 
[0080] The image analysis preferably uses computer display and printing algorithms, such as those, for example, 
known to one of skill in the art, for computer monitor display and computer printing. The data display can include "pseudo- 

55 color" images selected by the user for the individual fluorescent colors of the tissue and reference nucleic acids. The 
array data display can be coupled with display of conventional chromosome ideograms to more clearly detail chromosome 
abnormalities and expressed gene abnormalities identified by the method of the invention. See U.S. Patent 5,665,549, 
Figure 9, for an exemplary ideogram. Preferably, the array data is also displayed so that spots excluded from analysis 
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are marked for ready identification by the user. This can be done by displaying that target element in an "error color" or 
with a colored circle around it. 

[0081] In the preferred embodiment, the array reader and software automatically capture four images of each chip, 
specific for: (1 ) the DAPI counterstain (blue), (2) the tissue DNA (green), (3) the tissue cDNA (red), and (4) the reference 

5 DNA (orange). These images are referred to as color planes. However, images for more or different color planes can 
be taken. The image analysis portion of the software preferably uses one of the colors (preferably the DAPI image) to 
identify target elements and their location in the grid. Once all spots are identified the software analyses each pixel under 
each spot for its intensity in each of the remaining color planes. Suitable algorithms are employed to determine the local 
background for each of these color planes, which is then subtracted from the total intensity of each color. The background 

10 corrected intensities can then be averaged for all pixels under a particular target spot or group of spots, and this average 
intensity per pixel (e.g., A for DAPI intensity, B for tissue DNA intensity, C for tissue cDNA intensity and D for reference 
intensity) can be used for various analyses. 

[0082] For example, the intensity A may be used as an indicator of target spot quality, since the intensity of DAPI 
staining is a function of total amount of DNA attached at the target spot. Below a certain value for A (under controlled 
15 staining conditions) the amount of target element DNA may become rate limiting. The intensity D of the reference DNA 
can be used as an indicator for the efficiency of hybridization, since this reagent is preferably provided in a pre-determined 
concentration and is quality controlled. 

[0083] In the preferred analysis, the most important information is the ratio of background corrected tissue intensity 
over background corrected reference intensity; i.e. for the above example the ratios of B/D and C/D. If more than one 

20 reference is used, then additional ratios can be taken to give informative data. These ratios can be determined for a 
group of spots, a single spot, or for each pixel under each spot. In the most preferred mode, and for the example listed 
above, the B/D and C/D intensity ratios are being determined for each pixel, which should be independent on their 
absolute intensity in any of the colors. In other words, a plot of B versus D, for example, for each pixel under each spot 
should yield a scatter around a straight line, which should intersect both the X and Y axis at 0, if the background correction 

25 was appropriate. (Appropriate algorithms can generate such a plot by "clicking" on a given target spot or group of spots 
in the display.) This plot reveals two types of information: 

[0084] First, the amount of scatter around the linear regression line is indicative of the quality of the data, and can be 
statistically evaluated to generate a correlation coefficient, which for ideal spots is 1 (i.e. all pixel values fall on the 
regression line). A value less than 1 indicates less than perfect data, and a value of 0.8 or less is preferably taken as 
30 an indicator that data from such a spot should be considered suspect. This scatter plot can be generated for a single 
spot or group of spots. Second, the slope of this regression line is the B/D or C/D intensity ratio, respectively, for a given 
spot or group of spots. 

[0085] In order to extract the desired biological information, the B/D or C/D ratio is preferably normalized with respect 
to a control spot or group of spots, for which these ratios can be correlated to a known level of DNA or RNA sequence 

35 jn the test probe mixture. This is done as follows: 

[0086] For analysis of genomic DNA the assumption is made that most of the tissue DNA sequences are in fact present 
in their normal copy number, i.e. two per genome (except for sequences from the sex chromosomes if the test tissue is 
from a male donor). For the reference DNA this is assumed to be true for all sequences (other than those from X or Y 
chromosomes if the reference DNA is from a male donor). Based on these assumptions the software compares the B/D 

40 or C/D ratios of all target spots and selects a group of ratios that appear to be very similar. This group of ratios is assumed 
to represent targets that are normal in the test tissue, and the average of that ratio is used to normalize all other ratios. 
In other words, the B/D or C/D ratios of all spots will be divided by the average B/D or C/D ratio, respectively, of this 
"normal group." Thus, the B/D or C/D ratios of all normal spots should be close to 1, while the B/D or C/D ratios from 
targets that are aneuploid (present in copy numbers larger or smaller than 2), will be around 0.5 or less (deletions) or 

45 1 .5 or above (additions or amplifications). 

[0087] The inventive combination of simultaneous expression and genomic analysis allows a correlation of the ex- 
pression level to the gene copy number, by using the ratios described above as follows: 

[0088] Assume that an assay was performed in which B is the intensity for the tissue genomic DNA, C is the intensity 
for tissue mRNA (cDNA) and D is the intensity for the reference genomic DNA. Then, the ratios to be obtained are as 
so follows: 



(B/D) = background corrected average pixel intensity ratio 

(Bg/Dg) = background corrected average pixel intensity ratio average for "normal" subgroup 

(B/D)/(Bg/Dg) = normalized B/D ratio = Bn/Dn 

55 (C/D) = background corrected average pixel intensity ratio 

(Cg/Dg) = background corrected average pixel intensity ratio average for 

"normal" subgroup 

(C/D)/(Cg/Dg) = normalized C/D ratio = Cn/Dn 
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[0089] The Bn/Dn ratio reveals the number of genomic copies of a given target sequence, the Cn/Dn ratio reveals the 
relative number of mRNA copies per genomic sequence, and the Cn/Bn ratio would indicate whether the relative mRNA 
copy number correlates with a relative change in the genomic copy number change. 

5 (11) Example Arrays 

[0090] Exemplary of the types of microarrays useful in the method of the invention is a prenatal array of about 100 
target elements without replicates, which comprise genomic DNA sequences from (a) the unique sequence regions 
immediately adjacent the repeat sequence regions of (i) all human telomeres and (ii) all human centromeres (taken from 
10 both p and q arm); (b) the "microdeletion" syndrome regions for DiGeorge, Smith-Magenis, Downs, Williams, Velocar- 
diofacial, Alagille, Miller-Dieker, Wolf-Hirschhorn, Cri du Chat, Cat Eye, Langer-Giedion, Kallmann and Prader-Willi/ 
Angelman syndromes; and (c) deletion regions identified with sterylsulfatase deficiency, muscular dystrophy and male 
infertility, and those believed tied to mental retardation that involve deletion of the sub-telomeric, unique sequence 
regions on each chromosome. 

15 [0091] Table 1 lists human genomic DNA clones useful in such an array. This prenatal array has powerful medical 
utility because of its capability to reliably detect multiple gross chromosomal changes causing inherited disease. The 
human prenatal array is also useful for post-natal testing, for fetal cell testing and for pre-implantation genetic testing 
on blastomeres and polar bodies. Table 1 includes the chromosomal loci and the disease correlated to each loci. 

20 TABLE 1 



45 



Prenatal Chip-Loci To Detect Copy Number Abnormalities in Non-Cancer Genetic Diseases 


Gene or Chrom. Locus 


Cyto. Loc. 


Disease 


1p tel 


1p tel 


Mental Retardation, other 


p58 


1p36 


1p36 deletion syndrome 


1 near can 




aneusomy & region marker 


1q tel 


1 q tel 


Mental Retardation, other 


2p tel 


2 p tel 


Mental Retardation, other 


2 ner can 




aneusomy & region marker 


2q tel 


2 q tel 


Mental Retardation, other 


3p tel 


3 p tel 


Mental Retardation, other 


3 near cen 




aneusomy & region marker 


3q tel 


3 q tel 


Mental Retardation, other 


4p tel 


4 p tel 


Mental Retardation, other 


WHSCR/WHSC 


4p16.3 


Wolf-Hirschhorn syndrome 


4 near cen 




aneusomy & region marker 


4q tel 


4 q tel 


Mental Retardation, other 


D5S23 


5p15.2 


Cri du chat syndrome 


5p tel 


5ptel 


Mental Retardation, other 


5 near cen 




aneusomy & region marker 


5q tel 


5 q tel 


Mental Retardation, other 


6p tel 


6 ptel 


Mental Retardation, other 


6 near cen 




aneusomy & region marker 


6q tel 


6 q tel 


Mental Retardation, other 


7p tel 


7 ptel 


Mental Retardation, other 


7 near cen 




aneusomy & region marker 


7q tel 


7qtel 


Mental Retardation, other 
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(continued) 





Prenatal Chip-Loci To Detect Copy Number Abnormalities in Non-Cancer Genetic Diseases 




U6M6 \Ji \slllVJIII. 


f!\/to 1 op 


Dicpacp 


5 


Fla^tin 


7 q 1 1 .23 


Williams svnHromp 




8p tel 


8 p tel 


Mpntal RptarHatinn nthpr 




ft npar ron 




anpusnmv R\ roninn markor 

aiicuouuiy lx icyiuii iiidirvci 


10 


3q te| 


8 q tel 


Mpntal RptarHatinn nthpr 




FXT1 


7 n°4 1 


1 annor-f^ioHinn cv/nHmmo 
l_al lycl -OlcUIUI 1 oyllUIUIIIC 




Qn tot 

5?p iei 


Q n tol 


Montal RotarHatinn nthor 
1 VI cl Hal r\Clai UdUUI 1, UlliCl 


15 


Q nsar ron 
3 I leal uci i 




a no i icnmu JL ran'trsn markor 
diicuouiiiy 01 icyiuii iiidirvei 


Qn tpl 

\7Lj ICI 


Q n tpl 


Montal RptarHatinn nthor 
iviciudi rvcidi udiiui i, uiiici 




1 Up ICI 


10 n tpl 

1 U U ICI 


Montal RptarHatinn nthor 
iviciudi rvcidi Udiiui I, uiiici 




1 n npar rpn 
i u i icdi uci i 




flnPiiQnmv/ A rpninn markor 
diicuouiiiy a ionium nidiivci 


on 


10n tpl 

1 Ulj ICI 


1 UU, ICI 


Mpntal RptarHatinn nthpr 

IVICI Ileal rvcidi UdUUI 1, UIIICI 




vv rojtj 


1 UU 1 H~U 1 O 


V/olnmrHinfarial/nir^ionrno cunHrnmoc 
v ClUUdi uiUidUidi/LJiucui yc oyi IUI Ul I ico 




1 1 n tpl 
I I p lei 


11 n tpl 


Montal RotarHatinn nthor 
IVIGIIldl rVCldl UdUUI 1, UIIICI 


25 


1 1 near pnn 
I I Ileal CUM 




anpiiQnmu JL re*nir\n marker 
diicuouiiiy cx icyiuii nidiivci 


1 1n tpl 

1 1 ICI 


11 fl tpl 


Montal RotarHatinn nthor 
iviciudi rvcidi udiiui i , uiiici 




12p tel 


1 2 p tel 


Montal RptarHatinn nthpr 
iviciudi rvcidi udiiui 1 1 uiiici 




1 9 npar rpn 

1 ^ I ICO 1 l/Cl 1 




anoiiQnmv & rt^n'mn markor 
diicuouiiiy ot icyiuii iiidirvei 


30 


1 9n tpt 
i t-H ICI 


12 o tpl 

\ £- \\ ICI 


Montal RotarHatinn nthor 
iviciudi rvcidi udiiui i, uiiici 




1 npar ron 
I «J 1 icdi UC1 1 




rhrnmnQnmp nniHw fl. ronip>n markor 
ui ii ui i luouiiic puiuy ot icyiuii nidiisci 




1 In tpl 


11 fl tpl 
1 O \\ ICI 


h^ontal RotarHatinn nthor 
iviciudi r\cidi udiiui t, uiiici 


35 


RR1 


11 n14 
i o q in 


Tricr»m\# 1 "X nthor 

i risomy io, oincr 


A An tol 

i*tq iei 


14 n tol 

i*t q iei 


ivieniai r\eiaroaiion f oiner 




i t near cen 




uiii uiTiUbuiTic puiuy ot r cyioii riidr Kci 




i oq iei 


1 c n tol 

I u q ici 


iviciudi rxcidr udiiori, oiner 


40 


i o near cen 








QMPDM 


i o q i i -q i u 


r^raaer-vviiii/Angeiman synaromes 




U I Do I U 


1 ^ n1 1 nil 

i o q i i -q i o 


nraaer-vviiii/Mngeirrian synurornes 


45 


1 fin tol 

i op lei 


1fi n tol 

id p iei 


hA antal RotarHatinn nthar 

Menial rscidraauon, oiner 


16 near cen 




aneusomy & region marKer 




1 fin tol 

i oq lei 


id q iei 


ivieniai tAciaroauon, oiner 




1 / p iei 


t / p iei 


ivieniai r\eiaraaiion f omer 


50 


flu 


17 p11 


Smith- Mag en is syndrome 




PMP22 or adjac 


17 p12 


CMT1A/HNNPP 




D17S258 


17 p13 


Miller-Dieker syndrome/Isolated Lissencephally 


55 


LIS1 


17 p13 


Miller-Dieker syndrome/Isolated Lissencephally 


17 near cen - 


17 p13 


aneusomy & region marker 




17q tel 


17 q tel 


Mental Retardation, other 
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(continued) 





Prenatal Chip-Loci To Detect Copy Number Abnormalities in Non-Cancer Genetic Diseases 




Gene or Chrom. Locus 


Cyto. Loc. 


Disease 


5 


18 near cen 




aneusomy & region marker 




18p tel 


18 p tel 


Mental Retardation, other 




18q tel 


1 8 q tel 


Mental Retardation other 


10 


18d1 1 3 orobe 


18 q1 1 .3 


Tri/lso Chromosome 18p 




19p tel 


1 9 p tel 


Mental Retardation other 




1 Q near can 




aneusomv & reaion marker 


15 


19q tel 


1 9 q tel 


Mental Retardation, other 


20p tel 


20 p tel 


Mental Retardation other 

IVICI 1 Lul 1 V*u UV/I 1 , vU Ivl 




JAG1 


20 p 11 


Alagiile syndrome 




20 near cen 

1 IwUI wwl 1 




aneusomv & reaion marker 


20 


20q tel 


20 q tel 


Mental Retardation other 

IV Ivl I1MI 1 ivlO 1 VI CI 11 V 1 1 1 vll Iwl 




21 q tel 


21 q tel 


Mental Retardation, other 




21 near cen 




aneusomv & reaion marker 

Ml IvUvwM IJf i vyiwi 1 M lUl l\wl 


25 


MNB or D21 S55 


21 q22.1 


Down syndrome 


ERG 


21 q22.1 


Down dvndrome 

L/virl 1 UVIIUIwlllw 




22q tel 


22 q tel 


Mental Retardation other 




22a near cen 




Cat Eve svndrome 


30 


GSCL 


22q11 


Veloeardiofacial/DiGeorae <;vndromp<? 




HIRA TLJPI F 1 


22q11 


V/plnrardinfapial/nif^pnrnA QunHrnmpQ 




XIY n tel 

Al 1 yJ ICI 


X/Y n tpl 


Mental Retardation other 
iviuiiiai rscicii uauwi i, uliici 


35 


O 1 o 


X n22 1 


ioi m ly uoio, a in ifscu 


KAI 
r\rAi_ 


X P22.3 


Kallmann QvnHrnmp 
rxaiii i lai n i oyiiuiuiiic 




AR 


Xq11-q12 


aneusomy & region marker 




XIST 


Xq13.2 


Region marker 


40 


Dystrophin exon 


Xp21 


Muscular Dystrophy 




X/Yqtel 


X/Yqtel 


Mental Retardation, other 




SRY 


Yp11.3 


xx males, etc. 


45 


AZFB 


Yq11.2 


male infertility/Yq marker 


AZFC 


Yq12 


male infertility/Yq marker 



[0092] Another example is the AmpliOnc™ genomic DNA target element array containing genomic sequences for 
each of the 52 oncogene or amplified gene loci listed in Table 2. 

50 

TABLE 2 



AmpliOnc Loci 


Gene or Chrom. Locus 


Cyto. Location 


Cancer Association 


NRAS 


1p13.2 


Breast cell line 


MYCL1 


1 p34.3 


Small cell lung cancer cell line, neuroblastoma cell line 
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(continued) 





AmpliOnc Loci 




Hano nr Phrnm 1 nrtic 
OclIC Ul VslllUIII. LUIU3 


vyiu. L.UUdllvJil 




5 


FfiR 


1 n^6 2-n36 1 






1 AMP? 


1 L]*lvJ-t_]0 1 


Dicaol ucit line 




RFI 


jlfj 1 O [J 1 ^ 


iNuii*nuuyi\ni o i_yiiipi lunid 


10 


Al K 




ly f 1 1 \j\ i id 




MYPN fKI-m\/p\ 

IVI T IN ^IN'lliyO^ 


?n24 3-n24 1 


Npi irnhlaQtnma 

INCUI UUIdOlUI 1 let 




RAF1 


^n25 


Nlnn-Qmall ppII Innn ranrpr 

INUII Oilldll OCII lUII^ UCII ll/CI 


15 






Pprwiral Maari A Mprk 1 unn 
v^C/i viudi, ncau ex iNcv^rv, l. uiiy 




OLjtO .O 


Piwarian 
Uval Idi l 




Dpi fi 




lumnhnmfl 

i y 1 1 1 pi \\jf i la 




prjf^FRA 
ruurnn 


4n1 1-n12 


P^ilnhlaQtnma 


on 


MYR 

IVI 1 o 


6a22 


Pnlnrprtal" 1 pukpmijv Mplannma. 




FSR1 fFR FSR^ 


6a25 1 


Rrpact 




EGFR (ERBB1, ERBB) 


7p12.3-p12.1 


Glioma; Head & Neck 


25 


PGY1, MDR1 


7q21 


Drug resistant cell lines 


MET 


7q31 


Gastric 




FGFR1, FLG 


8p11.2-p11.1 


Breast 




MOS 


8q11 


Breast - 


30 


ETO, MTG8, CBFA2T1 


8q22 


leukemia 




MYC (c-myc) 


8q24.12-q24.13 


Small cell lung, Breast, Esophageal, Cervical, Ovarian, Head & 

KJprk ptr 




ARI 1 /ARI ^ 


Qn34 1 

v/^OH. 1 


PMI 


35 






Breast 




UDAC 

nr\MO 




PpJpirpptal RlartHpr 






i i q i o 


ncdU Ol INcUK, Cbupildycdl, Dicdal, ncpdllt, Wvdlldii 


40 


ror*f (no I rl, Flo 1 ) 


i iq io 


Dicdoi, vyvaridn 




rorrO (IIN 1 c. ) 


i iq i j 


Rmact Piwarian r^octrif* Molonnma WqqH J2. Mori/ 
Dlcdbl, WvdildH, odbulO, IVIcldl lUIIld , ntJdU <X INCUR 




CMC1 

cMol 


i nqno 


Dieasi, Diaaaer 




P2APD/ni 1 QA^PN 


i i q i o. o-vj i 


Breast 


45 




i iq i o.o-q i*t 


Breast 




Ml I /A| | 1\ 
MLL (ALL I ) 


i iq^o 


leuKemid 






I Zp I I 


Prt Irtror^tal PiQefrir* AHonnmrt ir>isl 1 i inn niont r*o 1 1 

ouiorecidi, odsinc, Aaenocor ucdi, Lung gidni ceil 


50 


CCND2 (Cyclin D2) 


12p13 


Lymphoma, CLL 




TEL (ETV6) 


12p13 


leukemia 




WNT1 (INT1) 


12q12-q13 


Retinoblastoma 




SAS; CDK4 


12q 13-q14 


Sarcoma, glioma 


55 


GL1 


12q 13.2-q 13.3 


Sarcoma, glioma 




MDM2 


12q 14.3-q 15 


Sarcoma, glioma 
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(continued) 





AmpliOnc Loci 




vjciiu ui milium. Lui/Uo 




wancci Mssociauon 


5 


AKT1 

r\T\ 1 1 




Odolllu 




PML 


15Q22 


ICUftcl 1 1 Id 






15n95-n9fi 


idl c dllipilUUi 1 


10 


FES 


15q26.1 


idle diiipuoun 




MRP 


1Rn n 1 


nn in rocictant roll linoc 
LJIUy lC?ololdlll IllltJo 




MYH1 1 
ivi t n i i 


1fin1^ 1^-n1^ 19 
I op I O. I o-p I O. I £ 


lOUnCiiild 


15 


PRFR 


1fin99 


lot iLomiQ 


RARA 


17q12 


Ipi i Wpmia 

ICUrVCI 1 lid 




HFR-9/nPii (FC^FR9\ 


17n 19-91 


Rroact Ov/arian f^actrif* 

Dicdbi, wvdridn, odoinu 




TOP9A 


17n91-n99 






T CO I 


i op I I .0 


odSinc 




uou-o segment 


1Rn91 ^ 
I 0CJ<1 1 .0 


iNon-noagKin s Lympnoma 




DULL v) ocyilltJIK 


1ftn91 ^ 
I 0\\l. 1 .0 


iNon-nougKin s Lyrnpnoma 


25 


IKIwD /inn ill rA/^Ar\^Ar\ 

iiNor\ unsuun receptor; 


1Qrk11 9 


Breast 


II (MR 




HeLa cell lines 






t yq i z 


Gastric, Ovarian 




RPI ^ 


i yq i o 


lyrnpnorna 


30 


AIR1 
MID I 


9nn19 

ziuq \£. 


Breast 






9nn1^ 
<cuq io 


Breast 




^ylvnl o 

IVI T 


9nri 1Q 1 

zuq i o. i 


Breast 


35 


PTPNJ1 

r 1 rlN 1 


zuq i o. i -q i 


Breast 


7MF917 /7AQP1\ 
t-INr^ I r ^£.r\Ow I } 


zuq i o.£ 


Breast 




STK15 (BTAK, aurora 2) 


20q13.2 


Breast, ovarian, colon, prostate, neuroblatoma and cervical 




AML1 (CBFA2) 


21q22.3 


leukemia 


40 


BCR 


22q11.21 


leukemia 




EWSR1 (EWS) 


22q12 


sarcoma 




PDGFB (SIS) 


22q 12.3-q13.1 


Rhabdomyosarcoma, liposarcoma 


45 


AR 


Xq11.2-q12 


Prostate 


Note: Alternate names for a gene are shown in parentheses. 



[0093] Genomic DNA target elements derived from the clones listed in Table 2 contain human genomic DNA inserts 
of about 50 kb to about 200 kb in a PAC, P1 or BAG vector. This array is produced without separation of the vector 
sequences. Use of this array permits simultaneous identification of genomic amplification of each of these oncogene 
loci, as well as expression of the genes which map into these regions. 

[0094] Yet another example is an AmpliOnc II array, which contains genomic DNA from the oncogene loci of Table 
2, supplemented by genomic DNA from the human tumor suppressor gene loci for: the p53, RB1 , WT1 , APC, NF1 , NF2, 
VHL, MEN1, MENZA, DPC4, MSH2, MCH1, PMS1, PMS2, P57/KIP2, PTCH, BRCA1, BRCA2, P16/CDKN2, EXT1, 
EXT2, PTEN/MMAC1, ATM, and TP73 genes. The genomic DNA target elements are produced by selecting genomic 
DNA clones from a human genomic library that map to the loci for these tumor suppressor genes. This selection is done 
by the preparation of PCR primer pairs from the loci or genes and subsequent library screening to identify the clones. 
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In this embodiment, the clones for the tumor suppressor loci can be about 20 kb to 250 kb, and are preferably about 50 
kb to about 200 kb in complexity. 

(12) Utility of the Invention 

5 

[0095] The methods of the invention have significant utility in the fields of genetic research, human disease manage- 
ment, human disease clinical research, human disease drug development and pharmacogenomics, human genetic 
research, animal drug development, animal disease management, animal genetic research, and plant genetic research. 
In particular, by enabling more precise genetic detailing of suspected cancerous tissue, the invention will provide improved 
10 disease management through more tailored diagnosis and therapy selection. The methods can also be used to determine 
the presence of viruses, viral integration into chromosomes and expression of viral genes. The method can also be used 
to simultaneously detect human genomic DNA abnormalities, human gene expression and gene expression of bacterial 
genes. 

[0096] The methods of the invention are particularly useful for genomic disease management of cancer and other 
is disease. For example, the methods are useful for categorizing genotype and phenotype of cancer, including those of 
the breast, prostate, lung (small cell and non-small cell), ovary, cervix, kidney, head and neck, pancreas, stomach, brain, 
soft tissue and skin, and of various btood or lymphatic system cancers such as leukemias and lymphomas. Once the 
tumor tissue genotype and phenotype are categorized by the method of the invention, the physician can combine this 
data with other clinical data to determine diagnosis, prognosis, therapy and predict response to therapy. 
20 [0097] The capabilities provided by the multi-color methods of the invention enable rapid comparative testing in drug 
development. For example, a cancer cell line can be dosed with a putative drug compound and at desired time intervals 
thereafter a cell sample cay be removed. Each of the removed cell samples, for example, collected at time 0, 10, 20 
and 30 hours after dosing, is treated to extract nucleic acids, which are then each labeled with a separate fluor. The four 
populations are then applied to the array with appropriate reference. The time-tracked effects of the drug on expression 
25 and initial chromosome status are thus assessed. Chromosomal change generally occurs over longer time periods and 
is not expected to change in this example. The method also can be applied to assess drug efficacy in drug resistant cell 
lines, particularly as drug resistance can be caused by gene amplification. 

EXAMPLES 

30 

[0098] The following examples are intended to be merely {Illustrative of the invention and are not to be construed as 
limiting. 

Example 1 

35 

(A) Procedures 
[0099] 

(i) Test array manufacture : Four inch x four inch chromium-coated plates (Nanofilm) were scored by U.S. Precision 

to Glass Company (Elgin, Illinois), and the scoring marked 24 equally sized chips. A 180 target element microarray was 
made on each chip. Before nucleic acid deposition, the plate was washed consecutively with distilled water, isopropanol, 
methanol and distilled water, allowed to dry and equilibrated to room temperature. The microarray was deposited centrally 
in each chip and occupied about 5 mm x 6 mm of chip surface. The microarray was made using a computer-controlled, 
single needle fluid deposition robot supplied by New Precision Technologies (Northbrook, Illinois). The robot was modified 

45 by addition of a laser-based Z-axis controller, a pressure regulatable nitrogen gas line hooked to the deposition pin and 
a platen sized to hold twelve, 4" x 4" plates. The robot used multiple deposition pins, each a 33 gauge, one-inch long 
steel capillary syringe needle linked to a Luer lock syringe tip from EFD. The capillary pins were each loaded with a 
different genomic DNA by loading into the Luer lock portion of the needle. The needle was changed manually after 
deposition of each target element on all chips on the platen. The microarray was made with approximately 400 micron 

50 spacing between target element centers in both the X and Y directions. 

The robot was controlled with computer software provided with the robot, which was modified to bring the capillary pin 
into contact with the chip surface and, at the contact moment, to apply a microburst of nitrogen pressure to the top of 
the pin. The contact and microburst period was about 10 milliseconds per target element. The gas pressure was about 
1 psi and was regulated manually, as necessary, to force sufficient amounts of the viscous genomic DNA out of the pin. 

55 The control conditions were set to deposit about 0.3 nl of 1 \ig/\i\ nucleic acid in 100 mm NaOH per spot. The deposited 
elements were approximately round, with variations noticeable under microscope examination after DAPI staining. The 
spot size also varied with the viscosity of the DNA. Individual chips were separated manually. 
The microarray comprised spots with genomic DNA from 31 human putative amplified gene loci, one spot of total human 
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genomic DNA, three control spots of pooled genomic DNA, each spot a pool of equal amounts of genomic DNA for ten 
of these oncogene loci, and one spot of lambda phage DNA. These thirty-six spots were replicated five times each on 
the microarray to produce the one hundred-eighty spot microarray. The 31 human putative amplified gene loci are listed 
below, and were genomic human DNA inserted into BAC, PAC or PI cloning vectors. Each of the genomic DNA for these 
5 loci was produced with DNA of a single BAC, PAC or PI clone, although the individual insert sizes were not uniform. 
These BAC clones were obtained by screening the available genomic libraries with a primer sequence for each locus, 
as follows: 



10 


GENE LOCUS 


CLONE NO. 


LIBRARY SOURCE 1 


MYCL1 


RMC01P052 


UCSF 




FGR 


RMC01P057 


UCSF 




REL 


BAC-274-P9 


GS 


15 


N-MYC 


PAC-254-N16 


GS 




RAF1 


BAC-98-L2 


GS 




PIK3CA 


PAC-97-B16 


GS 


20 


PDGFRA 


BAC-619-M20 


GS 


MYB 


BAC-268-N4 


GS 




EGFR 


BAC-246-M20 


GS 




MET 


BAC-54-J7 


RG 


25 


FLG 


BAC-566-K20 


GS 




C-MYC 


P1-469 


GS 




ABL 


PAC-763-A4 


RG 




BEK 


BAC-126-B28 


GS 


HRAS1 


BAC-137-C7 


GS 




BCL1 


PAC-128-18 


GS 




INT2 


BAC-36-F1 6 


GS 


35 


KRAS 


BAC-490-C21 


GS 




WNT1 


BAC-400-H17 


GS 




GLI 


RMC12POOt 


UCSF 


40 


CDK4 


BAC-561-N1 


GS 




MDM2 


BAC-82-N 15 


GS 




AKT1 


BAC-466-A19 


GS 




FES 


P1 1-2298 


GS 


45 


HER2 


P1-506 


GS 




YES1 


BAC-8-P19 


GS 




JUNB 


BAC-104-C10 


GS 


50 


20q13.2 


BAC-97 


GS 




PDGFB 


RMC22P003 


UCSF 



55 
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(continued) 



GENE LOCUS 



w 



AR 



CLONE NO. 



PAC-1097-P11 



LIBRARY SOURCE 1 



RG 



1 GS is Genome systems; RG is Research Genetics; UCSF 
is the LBL/UCSF Resource for Molecular Cytogenetics, 
University of California, San Francisco, Cancer Center. The 
clone number for each locus is shown. Human insert sizes 
ranged from about 60 kb to about 212 kb; not all inserts 
were measured. Chromosome location for each is in Table 
2 above. 



15 (ii) Tissue extractions and labeling: For each of SJSA-1 and Colo 320 cell lines, obtained from ATCC, the cells were 
centrifuged at 7,000 rpm at 4° C to produce cell pellets. Supernatant was discarded. The pellets were resuspended in 
Solution #2 of DNA Extraction Kit from Stratagene. The pellets were homogenized using a mechanical homogenizer 
at medium setting. Pronase was added to produce a pronase concentration of 100 ng/ml in each tube. Tubes were 
incubated with shaking at 60° C for one hour. Tubes were placed on ice for 10 minutes. Stratagene DNA Extraction Kit 

20 Solution #3 was added and the tubes again placed on ice for 5 minutes. Tubes were centrifuged for 15 minutes at 
8,000 rpm at 4° C to pellet the protein precipitate. The supernatant was decanted. RNase was added to the supernatant 
to produce an Rnase concentration of 20 jig/ml and the supernatant incubated at 37° C for 15 minutes. Two times the 
volume of ethanol was added and then centrifuged for 1 5 minutes at 1 0,000 rpm. Supernatant was decanted. The DNA 
pellets were dried under vacuum with a Speed Vac. The DNA pellets were resuspended in water and 995 jxl of 50 mM 

25 sodium hydroxide added. 

Cy-5 dUTP, from Amersham (Arlington Heights, Illinois) and a fluorescein labeled dCTP, produced according to Cruick- 
shank, was used in nick translation to label the extracted DNA. The nick translation of Cy-5 dUPT for SJSA-1 incorpo- 
ration used a standard protocol with a Promega (Madison, Wisconsin) nick translation kit. For Colo 320, 10 til of nick 
translation enzyme and 5 |xl of nick translation buffer (both from Vysis, Inc.) were mixed with 1 jig of extracted Colo 

30 320 DNA, 4 p,! each of dATP, dGTP and dTTP, 1 jd of dCTP, 2 jjlI of fluorescein dCTP, produced according to Cruick- 
shank, and sufficient water to produce 50 of solution. The mix was incubated at 37° C for 30 minutes. The enzyme 
was heat inactivated by heating at 80° C for 10 minutes. The solution was G-25 Spin Column purified and the labeled 
probe dried with Speed Vac. for 40 minutes. 

(ii') Hybridization : The nick translated DNA's (415 ng each), reference DNA (415 ng SpectrumOrange Total Human 
35 DNA (Vysis, Inc.), and Cot-1 DNA (100 jig), (LTI, Bethesda, Maryland) were mixed with about 15 \i\ LSI Hybridization 
Buffer, (Vysis, Inc.), to produce 25 jil of hybridization mix. The hybridization mix was pipetted onto the chip contained 
in a chip holder shown in Figure 1. The chip was glued in place in the holder using RTV 103 silicone rubber sealant 
(GE, Waterford, New York). The probe clip 33 of Figure 1 was applied as described above. The holder was then incu- 
bated at 37° C overnight in an enclosed moisture chamber. After hybridization, the probe clip was removed and the 
40 chip washed with 2X SSC at room temperature for 5 minutes, the 2X SSC and 50% formamide at 40° C for 30 minutes, 
and then 2X SSC at room temperature for 10 minutes. The washed chips were dried at room temperature in the dark. 
Ten p.l of GEL/Mount™ and DAPI were added and an 18 mm x 18 mm glass cover clip was placed over the array in 
the holder. 

(iv) Image Capturing and Analysis: A bread-board imaging apparatus of Che was used to capture large field images 
45 of the hybridized array through the array window, without removal of the probe clip or cover slip. The bread-board 
image included a dual filter wheel (Ludl) and single band pass filters (Chroma Technology, Battleboro, Vermont) for 
each of DAPI, fluorescein, SpectrumOrange and Cy5 were used for excitation and emission. Image data was processed 
using a Macintosh computer running algorithms that carried out the following steps: (1) Each target element spot is 
located from the DAPI image and assigned its grid location; (2) fluorescent intensities for each fluor at each spot are 
50 determined; (3) fluorescent ratios, by mode, median and mass, are calculated for each spot; (4) exclusion criteria based 
on spot size and intensity threshold; (5) composite images are produced and displayed on a computer monitor; (6) 
displayed images include white circles drawn around each spot and number of grid location; (7) printing capability for 
conventional computer-based printers; and (8) raw and processed data and image storage. (B) Results 
[0100] The fluorescence ratio for the Colo 320 compared to reference is shown in Table 3. As Table 3 indicates, the 
55 oncogene CMYC was amplified 32 fold in the Colo 320 cells. This compares to the known amplification of CMYC in Colo 
320 of 29 ± 6 fold (calculated from average of published data). A pseudo-colored composite image of the hybridization 
results showed significant color intensity for the CMYC elements, which also indicated amplification of the CMYC locus. 
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Table 4 shows the fluorescent ratio analysis results for the SJSA-1 cells compared to reference. Table 4 shows the GLI 
(9.4 fold), MDM2 (7.5 fold) and CDK4/SAS (12.1 fold) loci are each amplified in SJSA-1 cells. A pseudo-colored composite 
image of the hybridization results showed significant color intensity for the GLI, MDM2 and CDK4/SAS elements, also 
indicating amplification. Table 5 shows the fluorescent ratio of the Colo 320 signal compared to the SJSA-1 signal for 

5 most targets is around 1. However, the low ratio of the GLI (0.12), MDM2 (0.13) and CDK4/SAS (0.09) indicates these 
gene loci were amplified in SJSA-1 cells relative to the Colo 320 cells. The high ratio of target CMYC (40) indicates the 
CMYC amplification in the Colo 320 cells. The gene amplification observed with three probes (two sample probes and 
one reference probe) hybridized simultaneously to one chip was similar to that obtained by separate hybridizations of 
the SJSA-1 and Colo 320 DNAs onto separate chips. (Subsequent to data collection, it was learned that the clone for 

10 the AKT2 locus was not correctly mapped. The data shown in Tables 3, 4 and 5 and in Figure 2(a) through 2(h) for the 
AKT-2 target element are, thus, not meaningful.) 

[0101] This Example 1 is the first demonstration known to the applicants of a comparative hybridization of more than 
two separately-labeled nucleic acid populations to the same array. These results demonstrate the simultaneous hybrid- 
ization of three separately-labeled nucleic acid populations to a microarray to detect status of tissue nucleic acids. 

15 

Table 3. Test/Reference ratio analysis for the hybridization results of Example 1. CMYC amplification in Colo 320 

cells was observed. 
Norm. Ratio: (by mode) (by median) (by mass) 
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Table 4. Test/Reference ratio analysis for the hybridization results of Example 1. GLI, MDM2 and CDK4/SAS 

amplification in SJSA-1 cells was observed. 
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Table 5 . Test/Reference ratio analysis for the hybridization results of Example 1. GLI, MDM2 and CDK4/SAS 
amplification in SJSA-1 cells and CMYC amplification in Colo 320 cells were observed. 
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(1.00 


2%) 
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20 


MYCL1 


5 
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21 
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(0.98 
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0.949 
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7%) 


(1.04 
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0.948 


23 
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0.926 


24 
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(1,05 


4%) 
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26 
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5 


(0.91 
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1%) 
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3%) 


0.959 


27 
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5 


(1.15 


9%) 


(1.18 


2%) 


(1.20 


2%) 


0.906 


28 


FLG 


5 


(0.85 
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(0.83 


4%) 


(0.84 


4%) 


0.865 
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5 


(0.97 


5%) 


(0.98 


3%) 


(0.97 


3%) 


0.918 
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AKT1 


5 


(1.21 


696) 


(1.17 


3%) 


(1.15 


2%) 


0.893 


31 
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5 


(0.91 


9%) 


(0.96 


9%) 


(0.96 


7%) 


0.968 


32 


CDK4 


5 


(0.09 


4%) 


(0.11 


9%) 


(0.09 


3%) 


0.960 


33 


A.R 


5 


(1.00 


6%) 


(1.08 


2%) 


(1.12 


1%) 


0.960 


34 


d 


5 


(0.93 


11%) 


(0.99 


5%) 


(0.93 


3%) 


0.824 


35 


c2 


5 


(2.68 


18%) 


(2.40 


8%) 


(1.78 


5%) 


0.939 


36 


c3 


5 


(0.29 


3%) 


(0.31 


5%) 


(0.31 


3%) 


0.966 


All 
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7% 




7% 


0.954 



Normalizer 



0.31 



0.32 



0.30 



Example 2 

(A) Procedures 
[0102] 

(i) Array : The same 180 element microarray of Example 1 was used. 

(ii) Tissue extraction and labeling : Two cell lines were used in this experiment, Colo 320 and K562, both from ATCC. 
Five million cells of each were spun down (1.5K for 10 min.) to pellet. After decanting, 100 RNase solution and 
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300 jil lysis solution were added to the pellet and the mixture were vortexed at high speed briefly. The mRNA for 
each cell line were isolated by nitrocellulose-polyT using the isolation protocol was provided by the manufacturer 
(Ambion, Texas). 

The isolated mRNA was ethanol precipitated and reverse transcribed in the presence of Cy-5-dCTP (Amersham) 
5 using conventional protocol and primered by random pN9 to produce the Cy-5 labeled cDNA probe, of which one- 

fifth was used for each hybridization assay (one million cell for each assay). DNA was isolated for each cell line with 
conventional phenol-chloroform extraction and labeled with nick translation in the presence of fluorescein dCTP as 
in Example 1 to produce the labeled gDIMA. 

(iii) Hybridization : Each hybridization was at total volume of 25 \l\ consisting of 15 p,l LSI hybridization buffer (Vysis, 
10 Inc.), 200 ng cell line gDNA probe, 200 ng cell line cDNA probe, 200 ng SpectrumOrange Total Human Genomic 

DNA (Vysis, Inc.) as the reference, 20 \ig salmon sperm DNA and 40 y.g Cot-I DNA. Hybridization was to microarrays 
in chip holders with probe chip as in Example 1, and was carried out at 42°C in an enclosed moisture chamber for 
three days. For each cell line, the hybridization was duplicated on two chips. The overall process is shown below: 



15 



20 



25 



Cell lines (Colo 320) Ceil lines (K652) 

x x r x 

mRNA DNA mRNA DNA 

rt| |nt rt| |nt 

cDNA-Cy-5 RefDNA-SO DNA-G cDNA-Cy-5 RefDNA-SO DNA-G 

X f / X { / 



30 HYBRIDIZED TO CHIP HYBRIDIZED TO CHIP 

(iv) Imaging capturing and data analysis : Fluorescent images of hybridized chips were taken and analyzed, as in 
Example 1, with the breadboard dual-filter wheel imaging system of Che. Single-band pass filters were used for 
both excitation and emission. Images were analyzed with the same software as in 

35 

Example 1. 
(B) Results 

40 [0103] General description of figures: Data are presented as scatter plots and/or bar graphs. The scatter plots, with 
each point corresponding to a particular target clone, serve as statistical representation of data sets. The information 
for any given-target clone can be extracted from the bar graphs. 

(i) Signal Intensity : The intensities of background corrected signal for the genes in the microarray were comparable 
45 between tissue cDNA (average of 165 counts for 10 seconds exposure) and tissue gDNA (average of 187 counts 

for 10s exposure). Background associated with cDNA detection was higher, 132 counts as compared to 73 counts 
for gDNA. For both cDNA and gDNA, even the weakest signals were well above background (S/B > 1) with 60 
seconds exposure, provided that enough probe was deposited on the chip. 

(ii) Data reliability : Figure 2(a) shows the correlation of genomic DNA hybridization data obtained from two hybrid- 
50 izations for each of the cell lines. Linear regression correlation of the data for Colo 320 and K562 are 0.9963 and 

0.9999, respectively, indicating high reliability of the data. As expected, the ratios of the tissue gDNA over human 
reference gDNA formed a cluster for a majority of the target element genes (around one after normalization). Ratios 
that were distant from the cluster indicate gene amplifications in the cell lines for the corresponding genes (CMYC 
in Colo 320 and ABL in K562). It is interesting to note that for both cell lines that were tested, the "normal" cluster 
55 spans a ratio range from 0.5 to 1 .5. Within this range, the values of the ratio were highly reproducible between 

experiments and they were distributed such that it was believed unreliable to identify any particular gene within this 
cluster as deleted or amplified. 

Figure 2(b) shows the reliability of gene expression hybridization data obtained from two hybridizations for each cell 
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line. Linear regression correlation of the two sets of data for Colo 320 and K562 were 0.9989 and 0.9790, respectively. 

(iii) Assay Multiplexing : Figure 2(c) (for K562 cell line) and Figure 2(d) (for Colo 320 cell line) demonstrate the assay 
multiplexing achieved with the new assay format. With a separate genomic DNA assay, one could detect only the 
genomic copy numbers (relative to human reference) of the target sequences (green bars). With an expression 

5 cDNA assay, one could only detect the expression profile (some equivalence of red bars). With the method of the 

invention, the genomic and expression data were acquired simultaneously. 

(iv) Use of normal human total gDNA as reference for expression assay : 

Normally, because of lack of a "universal" or "normal" reference, the expression levels of two samples can be 
10 compared reliably only when the expression assays for the two samples are performed on the same chip in 

separate assays. Example 2 used total normal human gDNA as the reference nucleic acid for expression assay. 
When using the tissue cDNA and reference gDNA labeled with fluorochromes of different color, after hybridi- 
zation, the fluorescent intensity ratio of the two colors should reflect the initial concentration ratio of the cDNA 
and reference gDNA in the probe solution. If a particular reference gDNA is readily available and its copy 
15 numbers of gene specific sequence do not change (i.e., are "stable") or varies only negligibly, then it can be 

used as a universal reference for all expression assays. 

The expression profile can be expressed as the ratio of cDNA over reference gDNA as shown in Figure 2(e). 
This ratio profile is sample and sample only dependent. 

In other words, if two expression assays of the same sample are carried out in two separate hybridization on 
20 two different chips comprising the same array, the expression profiles obtained from the two assays should 

differ only by a scaling factor which is constant for all targets. Different samples will exhibit different expression 
profiles (expressed as ratio to reference genomic DNA). Comparison of Figures 2(b) and 2(e) show that the 
expression profiles are indeed sample and sample only dependent. With the use of total human genomic DNA 
as a reference for expression analysis in the methods of the invention, the expression profiles of different 
25 samples can be compared even if the assays are carries out separately and independently. 

(v.) Correlating genomic amplification to gene over-expression: Figure 2(f) and 2(g) are plots of genomic copy 
number vs cDNA (both relative to reference genomic DNA) for K562 and Colo 320 cell lines, respectively. 

30 [0104] As expected, within a cell line, except for the amplified genes, the expression levels for the rest of the genes 
analyzed varied widely while their genomic copy number maintains relatively constant. As shown in Figure 2(e), in both 
cell lines, for some genes, such as JUNB, HRAS1 , GLI, the cDNAs are more abundant while for others, such as PDGFRA, 
BEK, MDM2, the cDNAs are less abundant. Significantly, for C-MYC and ABL, the expression levels are very different 
for the two cell lines and the trend is in accordance with their amplification at the genomic level. The over-expression of 

35 C-MYC in Colo 320 and ABL in K562 can be attributed to gene amplification. Figure 2(h) is the plot of "gene expression" 
ratio vs "gene copy number" ratio between the two cell lines. Interestingly, there was a remarkable correlation between 
the two quantities. (Linear regression results, Y = 0.262X + 0.724, correlation 0.985). In the graph, genes that are 
unamplified in both cell lines form a cluster, while genes that are unequally amplified in the two cell lines are separated 
apart from the cluster. This graph, or more generally, the simultaneous genomic and expression assay, facilitates reliable 

40 attribution of over-or under-expression to gene amplification or deletion. 



Claims 

45 1. A method for simultaneous detection of gene expression and chromosomal abnormality in a tissue sample com- 
prising: 

(a) providing an array of nucleic acid target elements attached to a solid support wherein the nucleic acid target 
elements comprise polynucleotide sequences substantially complementary under preselected hybridisation 

so conditions to nucleic acids indicative of gene expression and of chromosomal sequence of a tissue sample; 

(b) providing at least three labelled nucleic acid populations: 

(i) a mRNA or cDNA population labelled with a first marker and derived from the tissue sample, 

(ii) a chromosomal DNA population labelled with a second marker and derived from the tissue sample, and 
55 (iii) at least one reference nucleic acid population labelled with a third marker; 

(c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and 

(d) detecting presence and intensity of each of the first, second and third markers to at least two target elements. 
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2. The method of claim 1 further comprising determining ratios at each target element (i) between the first and third 
markers and (ii) between the second and third markers. 

3. The method of claim 1 or 2 wherein the first, second and third markers each comprise a different fluorescent label. 

5 

4. A method for simultaneous detection of gene expression and chromosomal abnormality in a tissue sample com- 
prising: 

(a) providing an array of nucleic acid target elements attached to a solid support wherein the nucleic acid target 
10 elements comprise polynucleotide sequences substantially complementary under preselected hybridisation 

conditions to nucleic acids indicative of gene expression and of chromosomal sequence of a tissue sample; 

(b) providing at least three labelled nucleic acid populations: 

(i) a mRNA or cDNA population labelled with a first fluorescent colour and derived form the tissue sample, 
15 (jj) a chromosomal DNA population labelled with a second fluorescent colour and derived from the tissue 

sample, and 

(iii) at least one reference nucleic acid population labelled with a third fluorescent colour; 

(c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and 

20 (d) detecting presence and intensity of each of the first, second and third fluorescent colours to at least two 

target elements. 

5. A method for simultaneous detection of gene expression and chromosomal abnormality in a tissue sample com- 
prising: 

25 

(a) providing an array of nucleic acid target elements comprising genomic DNA attached to a solid support 
wherein the nucleic acid target elements comprise polynucleotide sequences substantially complementary under 
preselected hybridisation conditions to nucleic acids indicative of gene expression and of chromosomal se- 
quence of a tissue sample; 
30 (b) providing at least three labelled nucleic acid populations: 

(i) a mRNA or cDNA population labelled with a first fluorescent colour and derived form the tissue sample, 

(ii) a chromosomal DNA population labelled with a second fluorescent colour and derived from the tissue 
sample, and 

35 (jii) at least one reference nucleic acid population labelled with a third fluorescent colour; 

(c) contacting the array with the labelled nucleic acid populations under hybridisation conditions; and 

(d) detecting presence and intensity of each of the first, second and third fluorescent colours to at least two 
target elements. 

40 

6. The method of any of claims 1 to 4 wherein the target elements comprise genomic DNA. 

7. The method of any of claims 1 to 4 wherein the target elements comprise cDNA. 

45 8. The method of any of claims 1 to 4 wherein the array comprises cDNA and genomic DNA target elements. 

9. The method of any of claims 1 to 8 wherein the tissue sample is from a human. 

10. The method of any of claims 1 to 9 wherein the array comprises target elements at a density in the range of 100 to 
so 10,000 target elements per square centimetre. 

11. The method of any of claims 1 to 10 further comprising processing data from the detecting step (c) in a programmed 
computer, storing raw and processed data in a database and displaying raw and processed data. 

55 12. The method of any of claims 1 to 11 further comprising addition of unlabelled blocking nucleic acid. 

13. The method of any of claims 1 to 12 further comprising use of data derived from the method in selection of therapy 
for a human. 
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14. The method of any of claims 3 to 13 further comprising determining fluorescent ratios at each target element (i) 
between the first and third colours and (ii) between the second and third colours. 

15. The method of any of claims 1 to 14 wherein the tissue sample comprises a cell line sample 

5 

16. The method of any of claims 1 to 15 wherein the tissue sample comprises one cell. 

17. The method of any of claims 1 to 16 wherein the tissue sample comprises a human tumour sample. 

10 18. The method of any of claims 1 to 17 wherein the tissue sample comprises blood cells. 

19. The method of any of claims 5, 6 and 8 to 18 wherein the genomic DNAcomprises human genomic DNA having a 
complexity in a range of 20 kb to 250 kb. 

15 20. The method of any of claims 7 to 19 wherein the cDNA comprises cDNA having a complexity in a range of 100 bp 
to 5,000 bp. 

21. The method of any of claims 1 to 20 wherein the target nucleic acid elements comprise at least one peptide nucleic 
acid. 

20 

22. The method of any of claims 1 to 21 wherein the method is performed in a mesoscale device comprising a reaction 
chamber having a volume measured in amounts of 10" 8 or 10" 9 litres. 

23. The method of any of claims 1 to 22 wherein the array comprises at least 100 target elements. 

25 

24. The method of any of claims 1 to 23 wherein the array comprises at least 100 target elements on a planar surface 
of a substrate. 

25. The method of any of claims 1 to 24 wherein the chromosomal DNA population is produced by a method comprising 
30 PCR. 

26. The method of any of claims 1 to 25 wherein the tissue sample comprises a human blastomere cell or a human 
polar body. 

35 27. The method of any of claims 1 to 26 wherein the tissue sample is produced by microdissection. 

28. The method of any of claims 1 to 27 wherein the target nucleic acid elements comprise oligomers in the range of 8 
bp to about 100 bp. 

to 29. The method of any of claims 1 to 28 wherein the tissue sample comprises bladder, lung, prostate, breast, esophageal, 
cervical, ovarian, colon, brain, stomach, skin or pancreas tissue. 

30. The method of any of claims 1 to 29 comprising use of at least two reference nucleic acid populations. 

45 31. The method of any of claims 1 to 30 comprising use of at least four reference nucleic acid populations. 

32. The method of any of claims 1 to 31 wherein the tissue sample comprises a cancer cell line. 

33. The method of any of claims 1 to 32 wherein at least four separate fluorescently labelled nucleic acid populations 
50 are hybridised with the array. 

34. The method of any of claims 1 to 33 wherein at least eight separate fluorescently labelled nucleic acid populations 
are hybridised with the array. 

55 35. The method of any of claims 1 to 34, which further comprises: displaying at least one chromosome ideogram with 
array data. 
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Patentanspruche 

1 . Verfahren zum gleichzeitigen Nachweis von Genexpression und chromosomaler Abnormalitat in einer Gewebepro- 
be, umfassend: 

5 

(a) Bereitstellen einer Anordnung von an einen festen Trager gebundenen Nukleinsaurezielelementen, wobei 
die Nukleinsaurezielemente Polynukleotidsequenzen umfassen, die unter vorgewahlten Hybridisierungsbedin- 
gungen zu Nukleinsauren komplementar sind, wodurch Genexpression und chromosomale Sequenz einer 
Gewebeprobe angezeigt wird; 
10 (b) Bereitstellen von mindestens drei markierten Nukleinsaurepopulationen: 

(i) einer mRNA- Oder cDNA-Population, die mit einer ersten Markierung markiert ist und von einer Gewe- 
beprobe abgeleitet ist, 

(ii) einer chromosomalen DNA-Population, die mit einer zweiten Markierung markiert ist und von einer 
15 Gewebeprobe abgeleitet ist, und 

(iii) mindestens einer Bezugsnukleinsaurepopulation, die mit einer dritten Markierung markiert ist; 

(c) Inkontaktbringen der Anordnung mit den markierten Nukleinsaurepopulationen unter Hybridisierungsbedin- 
gungen; und 

20 (d) Nachweisen der Gegenwart und I ntensitat von jeder der ersten, zweiten und dritten Markierung an mindestens 

zwei Zielelementen. 
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2. Verfahren nach Anspruch 1 , ferner umfassend das Bestimmen von Verhaitnissen an jedem Zielelement (i) zwischen 
der ersten und der dritten Markierung und (ii) zwischen der zweiten und dritten Markierung. 

3. Verfahren nach Anspruch 1 Oder 2, wobei die erste, zweite und dritte Markierung jeweils eine unterschiedliche 
Fluoreszenzmarkierung umfassen. 

4. Verfahren zum gleichzeitigen Nachweis von Genexpression und chromosomaler Abnormalitat in einer Gewebepro- 
30 be, umfassend: 

(a) Bereitstellen einer Anordnung von an einen festen Trager gebundenen Nukleinsaurezielelementen, wobei 
die Nukleinsaurezielemente Polynukleotidsequenzen umfassen, die unter vorgewahlten Hybridisierungsbedin- 
gungen zu Nukleinsauren komplementar sind, wodurch Genexpression und chromosomale Sequenz einer 

35 Gewebeprobe angezeigt wird; 

(b) Bereitstellen von mindestens drei markierten Nukleinsaurepopulationen: 

(i) einer mRNA- Oder cDNA-Population, die mit einer ersten Fluoreszenzfarbe markiert ist und von einer 
Gewebeprobe abgeleitet ist, 

40 (ji) einer chromosomalen DNA-Population, die mit einer zweiten Fluoreszenzfarbe markiert ist und von 

einer Gewebeprobe abgeleitet ist, und 

(iii) mindestens einer Bezugsnukleinsaurepopulation, die mit einer dritten Fluoreszenzfarbe markiert ist; 

(c) Inkontaktbringen der Anordnung mit den markierten Nukleinsaurepopulationen unter Hybridisierungsbedin- 
45 gungen; und 

(d) Nachweisen der Gegenwart und Intensitat jeder der ersten, zweiten und dritten Fluoreszenzfarbe an min- 
destens zwei Zielelementen. 

5. Verfahren zum gleichzeitigen Nachweis von Genexpression und chromosomaler Abnormalitat in einer Gewebepro- 
50 be, umfassend: 

(a) Bereitstellen einer Anordnung von Nukleinsaurezielelementen, umfassend an einen festen Trager gebun- 
dene genomische DNA, wobei die Nukleinsaurezielemente Polynukleotidsequenzen umfassen, die unter vor- 
gewahlten Hybridisierungsbedingungen zu Nukleinsauren komplementar sind, wodurch Genexpression und 

55 chromosomale Sequenz einer Gewebeprobe angezeigt wird; 

(b) Bereitstellen von mindestens drei markierten Nukleinsaurepopulationen: 

(i) einer mRNA- Oder cDNA-Population, die mit einer ersten Fluoreszenzfarbe markiert ist und von einer 
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Gewebeprobe abgeleitet ist, 

(ii) einer chromosomalen DNA-Population, die mit einer zweiten Fluoreszenzfarbe markiert ist und von 
einer Gewebeprobe abgeleitet ist, und 

(iii) mindestens einer BezugsnukleinsSurepopulation, die mit einer dritten Fluoreszenzfarbe markiert ist; 
(c) Inkontaktbringen der Anordnung mit den markierten NukleinsSurepopulationen unter Hybridisierungs- 
bedingungen; und 

(d) Nachweisen der Gegenwart und Intensitat jeder der ersten, zweiten und dritten Fluoreszenzfarbe an min- 
destens zwei Zielelementen. 

6. Verfahren nach einem der Anspruche 1 bis 4, wobei die Zielelemente genomische DNA umfassen. 

7. Verfahren nach einem der Anspruche 1 bis 4, wobei die Zielelemente cDNA umfassen. 

8. Verfahren nach einem der Ansprilche 1 bis 4, wobei die Anordnung cDNA und genomische DNA umfasst. 

9. Verfahren nach einem der Anspruche 1 bis 8, wobei die Gewebeprobe vom Menschen stammt. 

10. Verfahren nach einem der Anspruche 1 bis 9, wobei die Anordnung Zielelemente mit einer Dichte im Bereich von 
100 bis 10.000 Zielelementen pro Quadratzentimeter umfasst. 

1 1 . Verfahren nach einem der Anspruche 1 bis 1 0, ferner umfassend das Verarbeiten von Daten aus dem Nachweisschritt 
(c) in einem programmierten Computer, Speichern von unverarbeiteten und verarbeiteten Daten in einer Datenbank 
und Anzeigen von unverarbeiteten und verarbeiteten Daten. 

12. Verfahren nach einem der Anspruche 1 bis 11, ferner umfassend die Zugabe einer unmarkierten blockierenden 
NukleinsSure. 

13. Verfahren nach einem der Anspruche 1 bis 12, ferner umfassend die Verwendung von dem Verfahren abgeleiteten 
Daten bei der Therapiewahl fur einen Menschen. 

14. Verfahren nach einem der Anspruche 3 bis 1 3, ferner umfassend das Bestimmen von FluoreszenzverhSltnissen an 
jedem Zielelement (i) zwischen der ersten und dritten Farbe und (ii) zwischen der zweiten und dritten Farbe. 

15. Verfahren nach einem der Anspruche 1 bis 14, wobei die Gewebeprobe eine Zellreihenprobe umfasst. 

16. Verfahren nach einem der Anspruche 1 bis 15, wobei die Gewebeprobe eine Zelle umfasst. 

17. Verfahren nach einem der Anspruche 1 bis 16, wobei die Gewebeprobe eine menschliche Tumorprobe umfasst. 

18. Verfahren nach einem der Anspruche 1 bis 17, wobei die Gewebeprobe Blutzellen umfasst. 

19. Verfahren nach einem der Anspruche 5, 6 und 8 bis 18, wobei die genomische DNA menschliche genomische DNA 
mit einer Komplexitat im Bereich von 20 kB bis 250 kB umfasst. 

20. Verfahren nach einem der Anspruche 7 bis 19, wobei die cDNA cDNA mit einer Komplexitat im Bereich von 100 
Bp bis 5.000 Bp umfasst. 

21. Verfahren nach einem der Anspruche 1 bis 20, wobei die ZielnukleinsSureelemente mindestens eine Peptidnukle- 
insSure umfassen. 

22. Verfahren nach einem der Anspruchel bis 21, wobei das Verfahren in einer Vorrichtung im Mesomafistab durch- 
gefOhrt wird, die eine Reaktionskammer mit einem Volumen umfasst, das in Mengen von 10" 8 oder 10 -9 Liter be- 
messen ist. 

23. Verfahren nach einem der Anspruche 1 bis 22, wobei die Anordnung mindestens 100 Zielelemente umfasst. 

24. Verfahren nach einem der Anspruche 1 bis 23, wobei die Anordnung mindestens 1 00 Zielelemente auf einer ebenen 
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Oberfiache eines Substrats umfasst. 

25. Verfahren nach einem der Anspruche 1 bis 24, wobei die chromosomale DNA-Population durch ein PCR umfas- 
sendes Verfahren hergestellt wird. 

26. Verfahren nach einem der Anspruche 1 bis 25, wobei die Gewebeprobe eine menschliche Blastomerzelle Oder ein 
menschliches PolkGrperchen umfasst. 

27. Verfahren nach einem der Anspruche 1 bis 26, wobei die Gewebeprobe durch Mikrosezieren hergestellt wird. 

28. Verfahren nach einem der Anspruche 1 bis 27, wobei die Zielnukleinsaureelemente Oligomere im Bereich von 8 
Bp bis etwa 100 Bp umfassen. 

29. Verfahren nach einem der Anspruche 1 bis 28, wobei die Gewebeprobe Blasen-, Lungen-, Prostata-, Brust-, Spei- 
serOhren-, Zervix-, Eierstock-, Damn-, Gehirn-, Magen-, Haut- Oder BauchspeicheldrQsengewebe umfasst. 

30. Verfahren nach einem der AnsprQche 1 bis 29, umfassend die Verwendung von mindestens zwei Bezugsnuklein- 
sSurepopulationen. 

31. Verfahren nach einem der Anspruche 1 bis 30,m umfassend die Verwendung von mindestens vier Bezugsnuklein- 
saurepopulationen. 

32. Verfahren nach einem der Anspruche 12 bis 31, wobei die Gewebeprobe eine Krebszellreihe umfasst. 

33. Verfahren nach einem der Anspruche 1 bis 32, wobei mindestens viergetrennte fluoreszierend markierte Nuklein- 
saurepopulationen mit der Anordnung hybridisiert werden. 

34. Verfahren nach einem der AnsprQche 1 bis 33, wobei mindestens acht getrennte fluoreszierend markierte Nukle- 
insSurepopulationen mit der Anordnung hybridisiert werden. 

35. Verfahren nach einem der AnsprQche 1 bis 34, ferner umfassend Anzeigen mindestens eines Chromosomenideo- 
gramms mit Anordnungsdaten. 

Revendications 

1 . Procede pour la detection simultanee d'une expression de genes et d'une anomalie chromosomique dans un echan- 
tillon tissulaire comprenant : 

(a) le fait de procurer une rangee d'elements cibles acides nucleiques attaches a un support solide dans lequel 
les elements cibles acides nucleiques comprennent des sequences polynucleotidiques substantiellement com- 
plementaires, dans des conditions preselectionnees d'hybridation, d'acides nucleiques revelateurs d'une ex- 
pression de genes et d'une sequence chromosomique d'un echantillon tissulaire; 

(b) le fait de procurer au moins trois populations marquees d'acides nucleiques : 

(i) une population d'ARNm ou d'ADNc marquee par un premier marqueur et provenant de I'echantillon 
tissulaire, 

(ii) une population d'ADN chromosomiques marqu6e par un deuxieme marqueur et provenant de I'echan- 
tillon tissulaire, et 

(iii) au moins une population d'acides nucleiques de reference marquee par un troisieme marqueur; 

(c) la mise en contact de la rang£e avec les populations marquees d'acides nucleiques dans des conditions 
d'hybridation; et 

(d) la detection de la presence et de I'intensite de chacun des premier, deuxieme et troisieme marqueurs sur 
au moins deux elements cibles. 

2. Proced6 suivant la revendication 1 , comprenant en outre la determination des rapports sur chaque element cible 
(i) entre les premier et troisieme marqueurs et (ii) entre les deuxieme et troisieme marqueurs. 
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3. Proc6d6 suivant la revendication 1 ou 2, 

dans lequel les premier, deuxieme et troisieme marqueurs comprennent chacun un marqueur fluorescent different. 

4. Procede pour la detection simultanee d'une expression de genes et d'une anomalie chromosomique dans un echan- 
5 tillon tissulaire comprenant : 

(a) le fait de procurer une rangee d'elements cibles acides nucleiques attaches a un support sotide dans lequel 
les elements cibles acides nucleiques comprennent des sequences polynucleotidiques substantiellementcom- 
plementaires, dans des conditions preselectionnees d'hybridation, d'acides nucleiques revelateurs d'une ex- 

10 pression de genes et d'une sequence chromosomique d'un echantillon tissulaire; 

(b) le fait de procurer au moins trois populations marquees d'acides nucleiques : 

(i) une population d'ARNm ou d'ADNc marquee par une premiere couleur fluorescente et provenant de 
I'echantillon tissulaire, 

is (jj) une population d'ADN chromosomiques marquee par une deuxieme couleur fluorescente et provenant 

de I'echantillon tissulaire, et 

(iii) au moins une population d'acides nucleiques de reference marquee par une troisieme couleur fluores- 
cente; 

20 (c) la mise en contact de la rang6e avec les populations marquees d'acides nucleiques dans des conditions 

d'hybridation; et 

(d) la detection de la presence et de I'intensite de chacune des premiere, deuxieme et troisieme couleurs 
fluorescente sur au moins deux elements cibles. 

25 5. Procede pour la detection simultanee d'une expression de genes et d'une anomalie chromosomique dans un 6chan- 
tillon tissulaire comprenant : 

(a) le fait de procurer une rang6e d'elements cibles acides nucleiques comprenant de t'ADN genomique attaches 
a un support solide dans lequel les elements cibles acides nucleiques comprennent des sequences polynu- 

30 cleotidiques substantiellement complementaires, dans des conditions preselectionnees d'hybridation, d'acides 

nucleiques revelateurs d'une expression de genes et d'une sequence chromosomique d'un echantillon tissulaire; 

(b) le fait de procurer au moins trois populations marquees d'acides nucleiques : 

(i) une population d'ARNm ou d'ADNc marquee par une premiere couleur fluorescente et provenant de 
35 I'echantillon tissulaire, 

(ii) une population d'ADN chromosomiques marquee par une deuxieme couleur fluorescente et provenant 
de I'echantillon tissulaire, et 

(iii) au moins une population d'acides nucleiques de reference marqu6e par une troisieme couleur fluores- 
cente; 

40 

(c) la mise en contact de la rang6e avec les populations marquees d'acides nucleiques dans des conditions 
d'hybridation; et 

(d) la detection de la presence et de I'intensite de chacune des premiere, deuxieme et troisieme couleurs 
fluorescentes sur au moins deux elements cibles. 

45 

6. Procede suivant Tune quelconque des revendications 1 a 4, dans lequel les elements cibles comprennent un ADN 
genomique. 

7. Procede suivant Tune quelconque des revendications 1 a 4, dans lequel les elements cibles comprennent un ADNc. 

50 

8. Procede suivant Tune quelconque des revendications 1 a 4, dans lequel la rangee comprend des elements cibles 
d'ADNc et d'ADN genomique. 

9. Procede suivant I'une quelconque des revendications 1 a 8, dans lequel I'echantillon tissulaire provient d'un §tre 
55 humain. 

10. Proc6d6 suivant I'une quelconque des revendications 1 a 9, dans lequel la rangee comprend des elements cibles 
a une densite dans la gamme de 100 a 10 000 elements cibles par centimetre carre. 
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11. Procede suivant Tune quelconque des revendications 1 a 10, comprenant en outre le traitement des donnees de 
I'etape de detection (c) dans un ordinateur programme, stockant les donnees brutes et traitees dans une base de 
donnees et presentant les donnees brutes et traitees. 

5 12. Precede suivant Tune quelconque des revendications 1 a 1 1 , comprenant en outre I'addition d'un acide nucleique 
bloquant non marque. 

13. Procede suivant Tune quelconque des revendications 1 a 1 2, comprenant en outre I'utilisation de donnees provenant 
du procede dans la selection d'une therapie pour un §tre humain. 

10 

14. Procede suivant Tune quelconque des revendications 3 a 13, comprenant en outre la determination des rapports 
de fluorescence sur chaque element cible (i) entre les premiere et troisieme couleurs et (ii) entre les deuxieme et 
troisieme couleurs. 

15 1 5. Procede suivant Tune quelconque des revendications 1 a 1 4, dans lequel I'echantillon tissulaire comprend un echan- 
tillon de lignee cellulaire. 

16. Procede suivant I'une quelconque des revendications 1 a 1 5, dans lequel I'echantillon tissulaire comprend une cellule. 

20 1 7. Procede suivant Tune quelconque des revendications 1 a 1 6, dans lequel I'echantillon tissulaire comprend un echan- 
tillon tumoral humain. 

18. Procede suivant Tune quelconque des revendications 1 a 17, dans lequel I'echantillon tissulaire comprend des 
cellules sanguines. 

25 

19. Procede suivant I'une quelconque des revendications 5, 6 et 8 a 18, dans lequel I'ADN genomique comprend un 
ADN genomique humain presentant une complexite dans un intervalle de 20 kb a 250 kb. 

20. Procede suivant I'une quelconque des revendications 7 a 19, dans lequel I'ADNc comprend un ADNc presentant 
30 une complexite dans une gamme de 100 pb a 5000 pb. 

21. Procede suivant I'une quelconque des revendications 1 a 20, dans lequel les elements cibles acides nucleiques 
comprennent au moins un acide nucleique peptidique. 

35 22. Procede suivant I'une quelconque des revendications 1 a 21 , dans lequel le procede est realise dans un dispositif 
d'echelle meso comprenant une chambre reactionnelle presentant un volume mesure en des quantites de I'ordre 
de 10- 8 ou 10- 9 litre. 

23. Procede suivant Tune quelconque des revendications 1 a 22, dans lequel la rangee comprend au moins 1 00 elements 
^o cibles. 

24. Procede suivant I'une quelconque des revendications 1 a 23, dans lequel la rangee comprend au moins 1 00 elements 
cibles sur une surface plane d'un substrat 

45 25. Procede suivant Tune quelconque des revendications 1 a 24, dans lequel la population d'ADN chromosomiques 
est produite par un procede comprenant une PCR. 

26. Procede suivant Tune quelconque des revendications 1 a 25, dans lequel I'echantillon tissulaire comprend un blas- 
tomere humain ou un globule polaire humain. 

50 

27. Procede suivant I'une quelconque des revendications 1 a 26, dans lequel I'echantillon tissulaire est produit par 
microdissection. 

28. Procede suivant I'une quelconque des revendications 1 a 27, dans lequel les elements cibles acides nucleiques 
55 comprennent des oligomeres dans la gamme de 8 pb a environ 1 00 pb. 

29. Procede suivant I'une quelconque des revendications 1 a 28, dans lequel I'echantillon tissulaire comprend du tissu 
de vessie, de poumon, de prostate, de sein, d'oesophage, de col de I'uterus, d'ovaire, de c6lon, de cerveau, d'es- 



33 



EP 1 026 260 B1 



tomac, de peau ou de pancreas. 

30. Procede suivant Tune quelconque des revendications 1 a 29, comprenant I'utilisation d'au moins deux populations 
d'acides nucleiques de reference. 

5 

31 . Precede suivant Tune quelconque des revendications 1 a 30, comprenant I'utilisation d'au moins quatre populations 
d'acides nucleiques de reference. 

32. Procede suivant Tune quelconque des revendications 1 a 31, dans lequel I'echantillon tissulaire comprend une 
10 lignee cellulaire cancereuse. 

33. Procede suivant Tune quelconque des revendications 1 a 32, dans lequel on realise une hybridation d'au moins 
quatre populations distinctes d'acides nucleiques marquees de maniere fluorescente avec la rangee. 

15 34. Procede suivant Tune quelconque des revendications 1 a 33, dans lequel on realise une hybridation d'au moins huit 
populations distinctes d'acides nucleiques marquees de maniere fluorescente avec la rangee. 

35. Procede suivant I'une quelconque des revendications 1 a 34, qui comprend en outre : le fait de montrer au moins 
un ideogramme chromosomique avec les donnees de rangee. 
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7.45 mm 10.5 mm 




Figure 1(a) 
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Top View 




Figure 1(e) 



36 



EP 1 026 260 B1 



o 




CM 




CO 




o 


CM 


CD 


o 




o 




< 


□ 



o 
r? 



cnj 



o 

CM 



to 



— f- 
o 



CO 



o 

CO 



CM 



o 

CM 



z 

Q 

O) 



O 



CD 
CNJ 

: o> 



37 



EP 1 026 260 B1 



00 



e 

i 

< 

z 

Q 

D> 

1 



< 

Z 
Q 

o 



CI 
3 



o 
cm 

<© o 

* 8 
-« □ 




CM 



38 



EP 1 026 260 B1 



E 

3 
X 

< 



Q 

< 

Q 

o 



in 



5 

E 

=3 
X 

z: 
o 



in 

< 
a 



□ 



in 



GO 
EO . 
to 

wao 

U-MV 

ONnr 

OWN 

VOGMId 

21NI 

nav 
sad 

"Bd 

2GID0S 
OAm) 

no 

UVH 
I-S3A 

noa 

2U3H 
M39 

VSVUH 

UNM 

SAW 

VHrTOOd 

i±£B 

9=IOQd 

qiun 

OKI 



d 



39 



EP 1 026 260 B1 




3 

rvf 

a 



40 



EP 1 026 260 B1 




CD 



m tj- co cnj 



41 



EP 1 026 260 B1 



3.5 



3 " 



2.5 " 



2 -■ 



1.5 



1 " 



0.5 



• m * \ • 



ABL 



1 1 1 1 1 \ 

0 0.5 1 1-5 2 2.5 3 3.5. 

i [CDNA k«2 ]/[gDNA HumanRef ] ~ - 



Figure 2(f) 
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Figure 2(h) 
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